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Abstract 

Kernelization is a concept that enables the formal mathematical analysis of data reduction 
through the framework of parameterized complexity. Intensive research into the Vertex Cover 
problem has shown that there is a preprocessing algorithm which given an instance (G, k) of Ver- 
tex Cover outputs an equivalent instance (C, k') in polynomial time with the guarantee that 
C has at most 2k' vertices (and thus 0((fc')^) edges) with k' < k. Using the terminology of 
parameterized complexity we say that fc- Vertex Cover has a kernel with 2k vertices. There is 
complexity-theoretic evidence that both 2k vertices and 6(fc^) edges are optimal for the kernel 
size. In this paper we consider the Vertex Cover problem with a different parameter, the size 
FVS(G) of a minimum feedback vertex set for G. This refined parameter is structurally smaller 
than the parameter k associated to the vertex covering number vc(G) since FVS(G) < vc(G) and 
the difference can be arbitrarily large. We give a kernel for Vertex Cover with a number of 
vertices that is cubic in FVS(G): an instance (G, X, k) of Vertex Cover, where X is a feedback 
vertex set for G, can be transformed in polynomial time into an equivalent instance (G',X', k') 
such that k' < k, \X'\ < \X\ and most importantly \V{G')\ < 2k and \V{G')\ e 0{\X'\^). A 
similar result holds when the feedback vertex set X is not given along with the input. In sharp 
contrast we show that the Weighted Vertex Cover problem does not have a polynomial ker- 
nel when parameterized by FVS(G) unless the polynomial hierarchy collapses to the third level 
(PH = Eg). Our work is one of the first examples of research in kernelization using a non-standard 
parameter, and shows that this approach can yield interesting computational insights. To obtain 
our results we make extensive use of the combinatorial structure of independent sets in forests. 
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fl Introduction 

The Vertex Cover problem is one of the six classic NP-complete problems discussed by 
Garey and Johnson in their famous work on intractability |22l GTl], and has played an 
important role in the development of parameterized algorithms [151 1281 . A parameterized 
problem is a language L C S* x N, and such a problem is (strongly uniform) fixed parameter 
tractable if mem.bership of an instance {x,k) can be decided in f(k)\x\'^ time for some 
computable function / and constant c. Since the structure of Vertex Cover is so simple 
and elegant, it has proven to be an ideal testbed for new techniques in the context of 
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parameterized complexity. The problem is also highly relevant from a practical point of view 
because of its role in bioinformatics [T] and other problem areas. 

In this work we suggest a "refined parameterization" for the Vertex Cover problem 
using the feedback vertex number FVS(G') as the parameter, i.e. the size of a smallest vertex 
set whose deletion turns G into a forest. We give upper bounds on the kernel size for 
the unweighted version of Vertex Cover under this parameterization, and also supply 
a conditional superpolynomial lower bound on the kernel size for the variant of Vertex 
Cover where each vertex has a non-negative integral weight. But before we state our results 
we shall first survey the current state of the art for the parameterized analysis of Vertex 
Cover. 

There has been an impressive series of ever-faster parameterized algorithms to solve 
fc- Vertex Cover, which led to the current-best algorithm by Chen et al. that can decide 
whether a graph G has a vertex cover of size k in 0(1.2738'^ -I- kn) time and polynomial 
space [HI [Snj IHl [H]. The Vertex Cover problem has also played an important role in 
the development of problem kernelization |24j . A kernelization algorithm (or kernel) is 
a polynomial-time procedure that reduces an instance (x, k) of a parameterized decision 
problem to an equivalent instance {x',k') such that |a;'|,fc' < f{k) for some computable 
function /, which is the size of the kernel. We also use the term kernel to refer to the reduced 
instance {x', k'). 

The fc- Vertex Cover problem admits a kernel with 2k vertices and O(fc^) edges, which 
has been a subject of repeated study [3 [51 EHl 121 HI] and experimentation |T1[T3|. There is 
some complexity-theoretic evidence that the size bounds for the kernel cannot be improved. 
Since practically all reduction-rules found to date are approximation- preserving [281, it 
appears that a kernel with less than 2k vertices would yield a polynomial-time approximation 
algorithm with a performance ratio smaller than 2 which would disprove the Unique Games 
Conjecture [25]. A recent breakthrough result by Dell and Van Melkebeek jT2| shows that 
there is no polynomial kernel which can be encoded into 0(A:^~') bits for any e > unless 
the polynomial hierarchy collapses to the third level (PH = Sg), which suggests that the 
current bound of O(fc^) edges is tight up to logarithmic factors. 

This overview might suggest that there is little left to explore concerning kernelization for 
vertex cover, but this is far from true. All existing kernelization results for Vertex Cover 
use the requested size k of the vertex cover as the parameter. But there is no reason why we 
should not consider structurally smaller parameters, to see if we can preprocess instances 
of Vertex Cover such that their final size is bounded by a function of such a smaller 
parameter, rather than by a function of the requested set size k. We study kernelization for 
the Vertex Cover problem using the feedback vertex number FVS(G) as the parameter. 
Since every vertex cover is also a feedback vertex set we find that FVS(G) < vc(G) which 
shows that the feedback vertex number of a graph is a structurally smaller parameter than 
the vertex covering number: there are trees with arbitrarily large values of vc(G) for which 
FVS(G) = 0. We call our parameter "refined" since it is structurally smaller than the standard 
parameter for the Vertex Cover problem. 

Related Work. The idea of studying parameterized problems using alternative param- 
eters is not new (see e.g. [2H]), but was recently advocated by Fellows et al. (THl 1201 12H] 
in the call to investigate the complexity ecology of parameters. The main idea behind this 
program is to determine how different parameters affect the parameterized complexity of 
a problem. Some recent results in this direction include FPT algorithms for graph layout 
problems parameterized by the vertex cover number of the graph |21] and an algorithm to 
decide isomorphism on graphs of bounded feedback vertex number |26j . We are aware of 
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only two applications of this idea to give polynomial kernels using alternative parameters. 
Fellows et al. HOI HE] show that the problems Independent Set, Dominating Set and 
Hamiltonian Circuit admit linear-vertex kernels on graphs G when parameterized by 
the maximum number of leaves in any spanning tree of G. Very recently Uhlmann and 
Weller |3r gave a polynomial kernel for Two-Layer Planarization parameterized by the 
feedback edge set number, which is a refined structural parameter for that problem since it 
is smaller than the natural parameter. 

Our Results. We believe that we are one of the first to present a polynomial problem 
kernel using a non-standard but practically relevant refined parameter. We study the following 
parameterized problem: 

Fvs- Weighted Vertex Cover 

Instance: A simple undirected graph G, a weight function w : V{G) N'^, a 
feedback vertex set X C V{G) such that G — X is a. forest, an integer fc > 0. 
Parameter: The size \X\ of the feedback vertex set. 
Question: Is there a vertex cover C of G such that J^vec '^i'^) ^ f^"^ 

We also consider the unweighted variant fvs-Vertex Cover in which all vertices have a 
weight of 1. The problems FVS- Weighted Independent Set and fvs-Independent Set 
are defined similarly. Throughout this work k will always represent the total size or weight 
of the set we are looking for; depending on the context this is either a vertex cover or an 
independent set. 

We prove that fvs-Vertex Cover has a kernel in which the number of vertices is bounded 
by mm{0{\X\^^),2k). This bound is at least as small as the current-best Vertex Cover 
kernel, but for graphs with small feedback vertex sets our bound is significantly smaller. We 
also study the weighted version of the problem, and obtain a contrasting result: we show that 
FVS- Weighted Vertex Cover does not admit a polynomial kernel unless PH = S^. This is 
very surprising since both the weighted and unweighted versions of fc- Vertex Cover admit 
polynomial kernels and can be attacked using similar reduction rules |10) . To our knowledge 
we give the first example of a parameterized problem whose weighted and unweighted versions 
are both NP-complete and FPT, but for which the unweighted version allows a polynomial 
kernel but the weighted version does not. 

When we present our results we will state them in terms of fvs-Independent Set and 
FVS- Weighted Independent Set since this simplifies the exposition. Because we are using 
the size of a feedback vertex set as the parameter, there are trivial parameterized reductions 
between these problems: an instance (G, X, k) of FVS- Vertex Cover is equivalent to an 
instance (G, X, |y(G)| — k) of fvs-Independent Set with the same parameter value \X\. 
Hence our kernelization bounds for Independent Set will immediately carry over to Vertex 
Cover. 

)2 Preliminaries 

In this work we only consider undirected, finite, simple graphs. Let G be a graph and denote 
its vertex set by V{G) and the edge set by E{G). We denote the independence number of G 
by a{G), the vertex covering number by vc(G) and the feedback vertex number by FVS(G). 
We will abbreviate maximum independent set as MIS, and feedback vertex set as FVS. For 
V G V{G) we denote the open and closed neighborhoods of v by Ng{v) and A^g[i'], respectively. 
For a set 5 C F(G) we have NaiS) := \Jves ^civ) \ S, and Ng[S] := Ues^cH- We 
write G' C G if G' is a subgraph of G. The graph G[V{G) \ X] obtained from G by deleting 
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(a) Graph G and its indepen- (b) Decomposition for graph (c) Decomposition for graph 
dence decomposition. G/{{u,v}}. G—{u,v}. 



B Figure 1 Examples of the independence decomposition of a graph. Black vertices are in A, white 
vertices are in A'', gray vertices are in S and the edges in M are drawn with thick lines. 



the vertices in X and their incident edges is denoted hy G ~ X. The graph G[E{G) \ Y] 
obtained from G by deleting the edges in Y but not their endpoints is denoted by G /Y . 
Carefully observe the difference between these two operators: if {u, u} is an edge in G, then 
G — {u, v} is the graph obtained from G by deleting the vertices u, v and their incident edges, 
whereas G/{{u,v}} is the graph obtained from G by removing the edge {u,v} while leaving 
the endpoints u and v intact. 

We need the following proposition on the structure of maximum independent sets in trees 
by Zito [321 Theorem 2], which we re-state here in terms of forests: 

► Proposition 1. Let F be a forest. Then there is a unique partition of the vertex set V{F) 
into subsets A,N,S such that: 

1. Any MIS for F contains all vertices of A and no vertices of N . 

2. For each vertex v E S there is a MIS for F containing v and a MIS for F avoiding v. 

3. There is a perfect matching AI in F[S], and any MIS for F contains exactly one endpoint 
of each edge in M. 

4. The matching M contains all the a-critical edges of F: for all e G E{F) it holds that 
a{F) < a{F/{e}) ^ e e M. 

This partition is uniquely characterized by adjacency relations. The sets A, N, S form the 
described partition if and only if: 

I. There is a matching M on the vertices of S. 

II. No vertex of A is adjacent to another vertex of A or to a vertex in S. 
III. Each vertex of N is adjacent to at least two vertices of A. 

We will refer to this decomposition of the vertex set of a forest F into the subsets A, N, S 



with the matching AI as its independence decomposition (Figure ll. 

► Observation 1. Let G be a graph. If G' is a vertex-induced subgraph of G then a(G) > a(G'), 
so for all W C V{G) we have ck(G) > a{G — W). If G" is an edge-induced subgraph of G 
then a(G") > a{G), so for all Z C E{G) we have a(G) < a{G/Z). 

► Observation 2. If G is a graph and u is a vertex in G such that degQ{v) < 1 then there is 
a MIS for G that contains v. 



1^3 Cubic Kernel for FVS- Independent Set 

In this section we develop a cubic kernel for fvs-Independent Set. Consider an instance 
(G, X, k) of the problem, which asks whether graph G with the FVS X has an independent 
set of size k. Throughout this section F :— G — X denotes the forest obtained by deleting 
the vertices in X. Our starting point is the current-best Vertex Cover kernelization |H1 
Theorem 2.2] which exploits a theorem by Nemhauser and Trotter [Tl\ . 
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► Theorem 1. There is a polynomial-time algorithm that takes an instance (G, k) of Vertex 
Cover as input, and computes in polynomial time an equivalent instance (G", fc') such that G' 
is a vertex-induced subgraph of G with k' < k, \V{G')\ - k' < \V{G)\ ~ k and \V{G')\ < 2k'. 
We can ensure that G' does not contain any vertices of degree < 1. 

Through the correspondence between Vertex Cover and Independent Set we can use 



Theorem 1 to preprocess an instance {G,X,k) of fvs-Independent Set. 



Reduction Rule 1. Let {G,X,k) be the current instance of fvs-Independent Set. Run 



the algorithm from Theorem 1 on the Vertex Cover instance (G, |V^(G)| — k) and let the 
result be (G', |V^(G')| — k'). Obtain X' from X by deleting the vertices that were removed 
from G by the algorithm, and use (G', X' , k') as the new instance of fvs-Independent Set. 

When given an independent subset X' X oi the feedback vertices we can efficiently 
compute the largest independent set / in G which satisfies / n X = X': since such a set 
intersects X exactly in X' , and since it cannot use any neighbors of X' the maximum size 
is \X'\ + a{F — No{X')) and this is polynomial-time computable since F — Ng{X') is a 
forest. We can exploit this to assess which vertices from the FVS X might occur in a MIS 
of G. 

► Definition 2. The number of conflicts Conf pi {X') induced by a subset X' C X on a, 
subforest F' C C G is defined as CONFi./(X') := a{F') - a{F' - Ng{X')). 

This term Conf pi {X') can be interpreted as follows. Choosing vertices from X' in an 
independent set will prevent all their neighbors in the subforest F' from being part of the 
same independent set; hence if we fix some choice of vertices in X' , then the number of 
vertices from F' we can add to this set (while maintaining independence) might be smaller 
than the independence number of F' . The term Conf pi {X') measures the difference between 
the two: informally it is the price we pay in the forest F' for choosing the vertices X' in the 
independent set. We can now formulate our first new reduction rule. 

► Reduction Rule 2. If there is a vertex v ^ X such that CONFi^dw}) > \X\, then delete v 
from the graph G and from the set X. 

► Reduction Rule 3. If there are distinct vertices u,v E X with {u,v} ^ E{G) for which 
Conf^-({m, w}) > \X\, then add the edge {u,v} to G. 

Correctness of these two rules can be established from the following lemma. 

► Lemma 3. If X' ^ X is a subset of feedback vertices such that CONFp{X') > \X\ then 
there is a MIS for G that does not contain all vertices of X' . 

Proof. Assume that / C V{G) is an independent set containing all vertices of X' . We will 
prove that there is an independent set /' which is disjoint from X' with |/'| > |/|. Since 
ConFf(X') > \X\ it follows by definition that - a{F - Ng{X')) > \X\; since / cannot 
contain any neighbors of vertices in X' we know that \I n < a{F — Ng{X')), and 

since \V{G)\ = \X\-\-\V{F)\ we have |/| < \X\-\-a{F-NG{X')) < a{F). Hence the maximum 
independent set for F, which does not contain any vertices of X', is at least as large as /; this 
proves that for every independent set containing X' there is another independent set which 
is at least as large and avoids the vertices of X' . Therefore there is a MIS for G avoiding at 
least one vertex oi X' . M 

► Reduction Rule 4. If F contains a connected component T (which must be a tree) such that 
for all X' CX with \X'\ < 2 for which X' is independent in G it holds that Conft(^') = 0, 
then delete T from graph G and decrease k by a{T). 
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To prove the correctness of [Rule |4| we need the following lemma. 

► Lemma 4. Let T he a connected component of F and let Xj Q X he an independent set in G. 
//Conft(X/) > then there is a set X' C Xj with \X'\ < 2 such that Conft(X') > 0. 

Proof. Assume the conditions stated in the lemma hold. Consider the independence decom- 
position of T into the sets A, N, S, and let M be a perfect matching on T[S]. We will try 
to construct a MIS / for T that does not use any vertices in Ng{Xj); this must then also 
be a MIS for T — Ng{Xi) of the same size. By the assumption that Conft(^/) > any 
independent set in T must use at least one vertex in Nq{Xi) in order to be maximum, hence 
our construction procedure must fail somewhere; the place where it fails will provide us with 
a set X' as required by the statement of the lemma. 

Construction of a MIS. By [Proposition 1 any MIS for T must use all vertices in A, 
no vertices from N and exactly one endpoint of each edge in the matching M . It follows 
that if some vertex u e A is adjacent in G to a vertex x e Xi, then a(T — {v}) < a(T) 
and therefore a{T — Ng{x)) < a{T) which proves that CONFT({a;}) > 0; hence we can then 
use X' := {x} as our desired subset to prove the claim. In the remainder of the proof we 
may therefore assume that no vertex of A is adjacent in G to a vertex in Xj. 

We now start building our independent set / for T that avoids vertices in Ng{Xi). We 
start by taking all vertices of A in the independent set; we do not use any vertices in Ng{Xi) 
here since A n Ng{Xj) = by assumption. To augment / into a MIS for T it remains to 
add one endpoint of each edge in the matching AI. Since the endpoints of the matching are 
not adjacent to vertices in A by the adjacency rules of [Proposition |l| we can now restrict 
ourselves to the subgraph T' T[S] induced by the matched vertices since no choice of 
independent vertices in T[S] will conflict with the choice of the vertices A. If there is a 
vertex v in T' such that Nt' (v) = {u} and Ng{v) fl X/ = 0, then the edge {v, u} must be 
in the matching M (since M is a perfect matching in T[/S']). Because we must choose one 
of {u, v} in a MIS for T, and by Observation 2 choosing a degree-1 vertex will never conflict 
with choices that are made later on, we can add v to our independent set / while respecting 
the invariant that no vertex in / is adjacent in G to a vertex in Xj. Since we have then 
chosen one endpoint of the matching edge {u, v} in /, we can delete u, v and their incident 
edges to obtain a smaller graph T' (which again contains a perfect submatching of M) in 
which we continue the process. As long as there is a vertex with degree 1 in T' that has no 
neighbors in Xj then we take it into /, delete it and its neighbor, and continue. If this process 
ends with an empty graph, then by [Proposition |l| the set / must be a MIS for T, and since 
it does not use any vertices adjacent to Xj it must also be a MIS for T — Ng{Xj); but this 
proves that a{T) = a{T — Ng{Xi)) which means CONFr(X/) = 0, which is a contradiction 
to the assumption at the start of the proof. So the process must end with a non-empty 
graph T' C T such that vertices with degree 1 in T' are adjacent in G to a vertex in Xj 
and for which the matching M restricted to T' is a perfect matching on T'. We use this 
subgraph T' to obtain a set X' as desired. 

Using the subgraph to prove the claim. Consider a vertex vq in T' with degy,(i;o) = 
1, and construct a path P — {vq, wi, . . . , V2p+i} by following edges of T' that are alternatingly 
in and out of the matching M, until arriving at a degree-1 vertex whose only neighbor was 
already visited. Since T' is acyclic, M restricted to T' is a perfect matching on T' and 
we start the process at a vertex of degree 1, it is easy to verify that there must be such a 
path P (there can be many; any arbitrary such path will suffice), that P must contain an 
even number of vertices, that the first and last vertex on P have degree-1 in T' and that 
the edges {v2i,V2i+i} must be in M for all < i < p. Since we assumed that all degree-1 
vertices in T' are adjacent in G to Xj, there exist vertices xi, X2 € X such that vq G Ng{xi) 
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and V2p+i G Nq{x2)- We now claim that X' :— {xi,X2\ satisfies the requirements of the 
statement of the lemma, i.e. that CONFT({a;i, 2:2}) > 0. This fact is witnessed by considering 
the path P in the original tree T. Any MIS for T which avoids Nq{{xi, X2]) must use one 
endpoint of the matched edge {wq, wi}, and since the choice of vq is blocked because uq is a 
neighbor to Xi, it must use vi. But path P shows that vi is adjacent in T io V2, and hence 
we cannot choose V2 in the independent set. But since {^2, ^3} is again a matched edge, we 
must use one of its endpoints; hence we must use W3. Repeating this argument shows that 
we must use vertex V2p+i in a MIS for T if we cannot use vq; but the use of W2p+i is also 
not possible if we exclude iVG({a;i, a;2}). Hence we cannot make a MIS for T without using 
vertices in Ng{{xi, X2}) which proves that a{T) > a{T — Ng{{xi, X2})- By the definition of 
conflicts this proves that Conft(^') > for X' ~ {xi,X2}, which concludes the proof. M 

Using this lemma we can prove the correctness of [Rule |4l 



► Lemma 5. Rule 4 is correct: ifT is a connected component in F such that for all X' Q X 
which are independent in G and satisfy \X'\ < 2 it holds that ConFt(X') — 0, then 
a{G) ^ a{G-T) + a{T). 

Proof. Assume the conditions in the statement of the lemma hold. It is trivial to see 
that a{G) < a{G — T) + a{T). To establish the lemma we only need to prove that 
a{G) > a{G ~ T) + a{T), which we will do by showing that any independent set Iq-t in 
G—T can be transformed to an independent set of size at least |/G-T| + a(r) in G. So consider 
such an independent set Iq-t, and let Xj := Iq-t H X be the set of vertices which belong 
to both Iq-t and the feedback vertex set X. Suppose that a{T) > a{T — Ng{Xj)). Then 



by Lemma 4 there is a subset X' C Xi with \X'\ < 2 such that Conft(-'^') > 0. Since Xi is 
an independent set, such a subset X' would also be independent; but by the preconditions to 
this lemma such a set X' does not exist and therefore we must have a{T) = a{T — Ng{Xj)). 

Now we show how to transform Iq-t into an independent set for G of the requested 
size. Let It be a MIS in T - Ng{Xi), which has size a{T ~ Ng{Xi)) = a{T). It is 
easy to verify that Ig-t U It must be an independent set in G because vertices of T are 
only adjacent to vertices oi G — T which are contained in X. Hence the set Ig-t U It 
is independent in G and it has size |/g-t| + a{T). Since this argument applies to any 
independent set Ig~t in graph G — T it holds in particular for a MIS in G — T, which proves 
that a(G) > a{G — T) + a{T) which proves the claim. 

We introduce the concept of blockability for the statement of the last two reduction rules. 

► Definition 6. We say that the pair x,y E V{G) \ X is X-blockable if G contains an 
independent set X' C X of size \X'\ < 2 such that {x,y} C Ng{X'). 

This can be interpreted as follows: any independent set in G that contains X' cannot 
contain x or y, so the pair x, y is blocked from being in an independent set by choosing X' . It 
follows directly from the definition that if x, y is not X-blockable, then for any combination 
of M e Ng{x) n X and v e NG{y) n X we must have {u, v} e E{G). 

► Reduction Rule 5. If there are distinct vertices u,v E V{G) \ X which are adjacent in G 
and are not X-blockable such that degp{u),degp{v) < 2 then reduce the graph as follows. 
Delete vertices u, v and decrease k by 1. If u has a neighbor t in F which is not v, then make 
all vertices of Ng{v) D X adjacent to t. If v has a neighbor w in F which is not u, then make 
all vertices of Ng{u) D X adjacent to w. If the vertices t,w exist then they must be unique; 
add the edge {t, w} to the graph. 
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(a) [Rule |5 Shrinking unblockable degree-2 paths (b) [Rule |6| Removing unblockable pendants in 
in trees, (k' := k — 1) trees, (fc' := k — 2) 




B Figure 2 Illustrations of two reduction rules. The original structure is shown on the left, and the 
image on the right shows the structure after the reduction. Feedback vertices X are drawn in the 
bottom container, whereas the forest G — X is visualized in the top container. 



► Reduction Rule 6. If there are distinct vertices t, u, w, w in V{G) \ X such that deg^(u) = 
degp{v) — 3, Npit) = {u}, Np{w) = {v} and {u, v} S E{G) such that none of the pairs {u, t}, 
{v,w}, {t,w} are X-blockable, then reduce as follows. Let {p} = Np{u) \ {t,v} and let 
{q} = Np{v) \ {w,u}. Delete {t,u,v,w} and their incident edges from G, decrease k by 2, 
make p adjacent to all vertices of Nq (t) n X and make q adjacent to all vertices of Nq {w) n X . 
See [Figure |2| for an illustration of the final two reduction rules, which are meant to reduce the 
sizes of the trees in the forest F. The correctness of these rules can be proven by an exchange 



argument (Lemma 14 Lemma 15 1. Whereas Rule 4 deletes a tree T from the forest F when 
we can derive that for every independent set in G — T we can obtain an independent set 
in G which is a{T) vertices larger, these last reduction rules act locally within one tree, but 
according to the same principle. Instead of working on an entire connected component of F, 
they reduce subtrees T' C in situations where we can derive that every independent set 
in X can be augmented with a(T') vertices from T'. In Rule 5 we reduce the subtree on 
vertices {u,v} which has independence number 1, and in Rule 6 we reduce the subtree on 



vertices w} with independence number 2. Connections between the vertices adjacent 

to the reduced subtree are made to enforce that removal of the subtree does not affect the 
types of interactions between the neighboring vertices. 

When no reduction rules can be applied to an instance, we call it reduced. In reduced 
instances the number of vertices in F must be bounded by a function of \X\, which can be 
proven using the following notion. 

► Definition 7. Let F' be a subforest of F, and define the number of active conflicts induced 
on F' by the feedback set X as follows: AcTWEp>(X) :— J2x'ex CONFp>{X') using the 
abbreviation X := {X' \ X' C X A \X'\ < 2 A X' is independent in G}. 

The number of active conflicts induced on in a reduced instance is polynomially bounded 
in \X\. For every v Q X we have CONFi?({u}) < \X\ by Rule 2 and every pair of distinct 



non-adjacent vertices {u,v} C X satisfies CONFi?({M, u}) < \X\ by Rule 3 Hence for every 



reduced instance we have AcTlVEi;'(X) < ('2)|-'^|- A technical proof shows that in a 

reduced instance the number of active conflicts induced on the forest F is at least 

By combining this with the bound on the number of active conflicts we can obtain the 



following lemma, which is restated and proven in the appendix (Lemma 31 1 



► Lemma 8. Let (G, X, k) he a reduced instance of fvsTndependent Set with forest F := 
G~X. Then\V{G)\<\X\ + 83\X\^. 

By applying the reduction rules and the lemma which bounds the size of reduced instances 



we can obtain the following kernelization results: (see Section B.7l 
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► Theorem 9. fvs-Independent Set has a kernel with a cubic number of vertices: there is 
a polynomial-time algorithm that transforms an instance (G, X, k) into an equivalent instance 
{G',X',k') such that \X'\ < \X\, k' < k, \V(G')\'^k' < |y(G)|-fc, \V{G')\ < 2{\V{G')\~k') 
and \V{G')\ < \X\ + 83\X\^. 

► Corollary 10. fvs-Vertex Cover has a kernel with min(2fc, |X| +83|Xp) vertices. 

\^ No Polynomial Kernel for FVS-Weighted Independent Set 

In this section we show that the introduction of vertex weights makes the parameterized 
Independent Set problem harder to kernehze, by proving that fvs-Weighted Indepen- 
dent Set does not have a polynomial kernel unless PH = Eg. To establish this result, we 
introduce a new parameterized problem called Paired Vector Agreement and show 
that it does not have a polynomial kernel unless PH = Eg. We then finish the proof by 
giving a polynomial-parameter transformation to FVS- Weighted Independent Set. 

f-PAiRED Vector Agreement 

Instance: A list L consisting of t pairs of vectors (a*, 6*) for 1 < i < i such that each 
vector is an element of {0, 1, #, ?}™, and an integer fc > 0. 
Parameter: The number of pairs t. 

Question: Is it possible to choose one vector from each pair, such that the chosen 
vectors S induce at most k conflict positions? A position 1 < j < m in a vector is 
a conflict position if some chosen vector v (z S has Vj — or if we have chosen 
vectors it, u G S* such that Uj = and Vj — I. 

The framework for proving that a parameterized problem does not have a polynomial 
kernel unless PH = Eg requires us to establish that the corresponding classical problem 
is NP-complete. Therefore we consider the classic variant Paired Vector Agreement, 
which is defined similarly as i- Paired Vector Agreement with the exception that there is 
no parameter. A reduction from Vertex Cover shows that Paired Vector Agreement 
is NP-complete (appendix. Lemma |32 1 . By exploiting the fact that t- Paired Vector 



Agreement can be solved in 0(2*p(m)) time for some polynomial p (by trying all possible 
combinations of vectors), we can build an OR-composition algorithm for the paired agreement 



problem using a bitmask selection strategy (appendix. Lemma 33 1 ; the techniques we use 
here are similar to those employed by Dom et al. ^14j. These two facts prove that Paired 
Vector Agreement does not have a polynomial kernel unless PH = E|. To relate these 
results to FVS- Weighted Independent Set we use the following transformation. 

► Lemma 11. There is a polynomial-parameter reduction from t-PAlRED VECTOR AGREE- 
MENT to fvs-Weighted Independent Set. 

Proof. Let {L, t, m, fc) be an instance of i- Paired Vector Agreement. We may assume 
that k < m otherwise the answer to the instance is trivially yes. We show how to build 
an equivalent instance {G' ,w' , X' ,k') of fvs-Independent Set in polynomial time such 
that \X'\ = 2t, which implies the existence of a polynomial-parameter reduction. 

The graph G' has 2{t + m) vertices, and is defined as follows. For each index I < i < t 
there is a pair of vertices vf, v\ which are connected by an edge, and have weight 2{t -\- m). 
For each vector position \ < j < m there are vertices p'j,p] which are connected by an 
edge, and have weight 1. The vertices vf and correspond to the vectors a*, 6' of the 
<-Paired Vector Agreement instance, and are connected to the position-vertices as 
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follows. Let w be a vertex or corresponding to the vector VEC(t;) which is or bi, 
respectively. For I < i < t vertex v is adjacent in G' to all p'j for which vector VEC(ti) has 
a at position j; it is also adjacent to all pj for which vector VEC(i;) has a 1 at position j, 
and finally vertex v is adjacent to all {p^,p^} for which vector VEC(i') has a 7^ at position j. 
This concludes the definition of the structure of graph G' . 

One may verify that a position vertex pJ is adjacent to exactly 1 other position vertex pj"^, 
which implies that the graph induced by the position vertices p^'^ has maximum degree 1 and 
is therefore a forest; this shows that the vector- vertices u"^'' form a feedback vertex set for G" 
and thus we define the feedback vertex set for our instance as X' := {vf^v^ \ 1 < i < t} 
which has size exactly 2t. We now ask for an independent set of total weight at least k' := 
2t{t + m) + (m — k), which completes the description of instance (C, w' , X' , k'). It is easy 
to see that this instance can be computed in polynomial time from the instance (L, t, m, k). 
The proof that these two instances are equivalent is not difficult, and has been moved to the 



appendix (Lemma 34 1 due to space restrictions. -4 



Using standard kernelization lower-bound techniques (see [5J |T3]) the combination of 



Lemma 32 Lemma 33 (which are in the appendix) and Lemma 11 now allows us to derive 



the following results. 

► Theorem 12. The problems fvs-Weighted Independent Set and fvs-Weighted 
Vertex Cover do not admit polynomial kernels unless PH — E3. 

It is interesting to note that an instance (G", w' , X' , k') of fvs-Independent Set resulting 
from the polynomial-parameter transformation of [Lemma |1 1| has a very restricted graph 
structure: every connected component of the forest G" ~ X' is a path on two vertices. Hence 
our proof shows that even using the parameter "number of vertex deletions needed to turn 
the graph into a disjoint union of P2's" (a structurally larger parameter than the FVS size) 
there is no polynomial kernel unless PH = S3. 



5 Conclusion 



We have given upper and lower bounds on the size of kernels for the Vertex Cover and 
Independent Set problems using the parameter fvs(G). It would be very interesting 
to perform experiments with our new reduction rules to see whether they offer significant 
benefits over the existing Vertex Cover kernel on real-world instances. This result is one 
of the first examples of a polynomial kernel using a "refined" parameter which is structurally 
smaller than the standard parameterization. The contrasting result on the weighted problem 
shows that there is a rich structure waiting to be uncovered when studying kernelization using 
non-standard parameters. The kernel we have presented for fvs-Vertex Cover contains 
0(|Xp) vertices and can therefore be encoded in 0(|X|^) bits using an adjacency matrix. 
The results of Dell and Van Melkebeek [T^] imply that it is unlikely that there exists a kernel 
which can be encoded in 0(|Xp~') bits for any e > 0. It might be possible to improve 
the size of the kernel to a quadratic or even a linear number of vertices, by employing new 
reduction rules. The current reduction rules can be seen as analogs of the traditional "high 
degree" rule for the Vertex Cover problem, and it would be interesting to see whether it 
is possible to find analogs of crown reduction rules when using FVS(G) as the parameter. 

Although we have assumed throughout the paper that a feedback vertex set is supplied 
with the input, we can drop this restriction by applying the known polynomial-time 2- 
approximation algorithm for FVS [3|. Observe that the reduction algorithm does not require 
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that the supphed set X is a minimum feedback vertex set; the kerneHzation algorithm works 
if X is any feedback vertex set, and the size of the output instance depends on the size of 
the FVS that is suppUed. Hence if we compute a 2-approximate FVS and supply it to the 
kernelization algorithm, the bound on the number of vertices in the output instance is only a 
factor 2 worse than when running the kernelization using a minimum FVS. 

This paper has focused on the decision version of the Vertex Cover problem, but the 
data reduction rules given here can also be translated to the optimization version to obtain 
the following result: given a graph G there is a polynomial-time algorithm that computes a 
graph G' and a non-negative integer c such that VC(G) = vc(G") + c with |T^(G')| < 2 vc(G) 
and |y(G')| £ 0(fvs(G)'^); and a vertex cover S' for G' can be transformed back into a 
vertex cover of G of size |S"| -I- c in polynomial time. 

The approach of studying Vertex Cover parameterized by fvs(G) fits into the broad 
context of "parameterizing away from triviality" [28( 17]. since the parameter FVS(G) measures 
how many vertex-deletions are needed to reduce G to a forest in which Vertex Cover can 
be solved trivially in polynomial time. As there is a wide variety of restricted graph classes 
for which Vertex Cover is in P, this opens up a multitude of possibilities for non-standard 
parameterizations. For every graph class Q which is closed under vertex deletion and for 
which the Vertex Cover problem is in P, the Vertex Cover problem is in FPT when 
parameterized by the size of a set X such that G — X € Q, assuming that X is given as part 
of the input. Recent work [3] into this direction shows that whenever Q contains all cliques 
the resulting parameterized problem does not have a polynomial kernel unless PH = Sg. 
Examples of such classes Q are chordal graphs, interval graphs and other types of perfect 
graphs. We conclude with two specific open problems. Is there a polynomial kernel using 
the deletion distance from a bipartite graph as the parameter? Does the Vertex Cover 
problem parameterized by the size of a minimum set X such that treewidth(G — X) < i 
have a polynomial kernel for every fixed i, or is there some value of i for which this problem 
does not have a polynomial kernel? The classic Vertex Cover kernelizations can be 
interpreted as the case i = whereas this paper supplies the result for i — 1. It appears that 
many interesting insights are waiting to be discovered in this direction. 
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Ifik Additional Preliminaries 

The degree of a vertex v in graph G is denoted by degQ{v). For X C V{G) we denote by 
G[X] := {X, E{G) n {X X X)) the subgraph of G that is induced by the vertices in X, and 
for Y C E{G) we denote by G[Y] := {V{G),Y) the subgraph restricted to the edges in Y. 
If y C E{G) is a subset of edges then V{Y) denotes the endpoints of the edges in Y. In 
particular, we will use V{M) for a matching M to denote the endpoints of the matched 
vertices. A vertex of degree 1 is called a leaf. If is a vertex in a tree and v is not a leaf, then 
it is an internal node of the tree. The leaf set of a graph G is the set of degree-1 vertices, 
denoted as Leaves(G) := {v S V{G) \ degQ(w) — 1}. Define the set of high-degree vertices 
of G as V>3{G) = {v£ V{G) \ degdv) > 3}. 

► Proposition 2. Let T be a tree. The following must hold. 

1. \E{T)\ = \V{T)\-l. 

2. |Leaves(T)| > |y>3(T)|. 

3. If T does not contain degree-2 vertices then |£'(r)| < 2| LEAVES(r)|. 

► Lemma 13. Let T be a tree and consider its independence decomposition A, N, S , and 
let M be the corresponding matching on S. If e Cz E(T) \ M then a(T) = a(T/{e}). 



Proof. By Observation 1 we know that a{T) < a{T/{e}). By (4 1 of Proposition 1 wc know 



that a(T) > a(T/{e}) and hence the two independence numbers must be equal. 

B Omitted Proofs: Cubic Kernel for FVS-lndependent Set 
B.l Correctness of Reduction Rules 



► Lemma 14. Let (G, X, k) with F := G — X be an instance to which Rule 5 is applicable 
at vertices u, v, and let (G', X,k — I) be the instance resulting from the reduction. Then it 
holds that a{G) > k a(G') > fc - 1. 

Proof. Assume the conditions in the statement of the lemma hold. We prove the two 
directions separately. 

(^) Let la be an independent set for graph G of size at least k. We show how to obtain 
an independent set Ic for graph G' of size at least |/g| — 1 > fc — 1. Observe that no 
independent set in G can contain both {u,v} since they are adjacent. If Iq does not contain 
any of the vertices {u, v} then we show how to obtain Iq which is at least as large and does 
contain one of {u, v}; so assume Iq avoids u and v. Since the pair m, v is not X-blockable by 
the preconditions for the reduction rule, we know that there is at least one vertex among u, v 
for which no neighbor in X is chosen in Iq. Assume without loss of generality (by symmetry) 
that this holds for u, such that Ng{u) H X D Iq — 9- Since v is not in Iq by assumption, 
the only neighbor of u that can be in Iq is its neighbor t in F unequal to v (if such a t 
exists; see Figure 2 1. If no such t exists then Iq := Iq U {u} is a bigger independent set 
in G; otherwise Iq := {Iq \ {t}) U {u} is an equally large independent set. So using this 
replacement argument and symmetry, we may assume that Iq is an independent set of size 
at least fc for G that contains u but not v. 

We now claim that Iqi := Iq\ {u} is an independent set of size > fc — 1 in G'. Since 
it is easy to see that Iqi has the desired size, it remains to show that it is an independent 
set in G'. To establish this we need to show that the transformation to G' does not add 
any edges between vertices of Ic ■ This is ensured because all edges that are added by the 
transformation have at least one endpoint which is a neighbor of u: all added edges are 
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either incident on w or a vertex in Ng{u) D X. Hence for each added edge one endpoint z is 
adjacent to u, and since we assumed u ^ Iq this impHes that z cannot be in Iqi since Iq' is 
a subset of the independent set Ig in G and having adjacent vertices u and z in Iq would 
violate independence. Therefore Iq' is indeed an independent set of the required size in G". 

{<=) Let Ig' be an independent set for graph G" of size at least k — 1. We show how to 
obtain an independent set Iq for graph G of size at least \Ig' \ + 1 > A:. The structure of Ig' 
determines how to augment to a larger independent set Ig ■ From the structure of the reverse 
transformation of G' to G it follows that Ig' is an independent set in G\ hence for each case 
we will only show that the new vertex we add to the set will not violate independence in 
graph G. We now do a case analysis based on whether or not the neighbors t oi u and w 
of V are present. 

H If vertex t exists and t G Ig', then define Ig '■— Ig' U {v}. To prove Ig is an independent 
set in G we show that Ng{v) n Ig' = by consecutively proving that {u, w} H Ig' ^ ^ 
and Ng{v) Ci X Ci Ig' — 0, which together suffice to establish our claim because Ng{v) = 
{u, w} U {Ng{v) n X) (for as far as t exists). Since u ^ V{G') we trivially have u ^ Ig', 
and because the edge {t, w} is added when forming G" and t G Ig' by the case distinction 
we have w^Ig'- To see that Ng{v) n X n /c = observe that Ng{v) n X C XG'{t) by 
the construction of G', and since t G Ic and Ig' is independent in G' this proves the 
claim and the correctness of this case. 

B If vertex w exists and w £ Ig', then define Ig '■— Ic U {it}. The correctness argument is 
symmetric to that of the previous case. 

H In the remaining case we know that {t,w} H Ig' — 0. There must be some z S {u,v} 
such that Ng{z) D X D Ig' = 0; because if there is no such z then by combining one 
vertex from Ng{u) X H Iq' and one from Ng{v) H X H Ig' gives a pair which proves 
that {u, v} is X-blockable in G, contradicting the precondition to the reduction rule. We 
now assign Ig ■— Ig' U {z}. Since Ng{z) n F C {t, u, v, w} and these vertices either do 
not exist in G' or are not in Ig' by the case distinction, we know {t, u, v, w} D Ig' = 0. 
Since Ng{z) 0X0 Ig' = by our choice of z this proves that the addition of z to the 
independent set does not violate independence, because Ng{z) C {NG{z)OX)lJ{t^ u, v, w}. 

Since the case distinction is exhaustive this establishes the claim in this direction, which 

concludes the proof. 



► Lemma 15. Let (G, X, k) with F .— G — X be an instance to which Rule 6 is applicable 
at vertices t, u, v, w, and let (G' , X.k — 2) be the instance resulting from the reduction. Then 
it holds that a(G) > fc <^ a(G') > fc - 2. 

Proof. Assume the conditions in the statement of the lemma hold. We prove the two 
directions separately. 

(^) Let Ig be an independent set for graph G of size at least k. We show how to obtain 
an independent set Ig' for graph G' of size at least — 2 > k — 2. We first show that 
without loss of generality we may assume that for one of the pairs {t, w}, {t, v}, {u, w} both 
vertices of the pair belong to Ig- To see this, suppose that Ig avoids at least one vertex in 
each pair. We then obtain an alternative independent set Iq which is at least as large, and 
contains both vertices of at least one pair. 

- If lGOXONG{t) = and iGOXONGiw) = then define := (/g\ {u, d, t, w}) U {i, w} 
which is easily seen to be an independent set. Since no independent set can contain three 
or more vertices from {u, v, t, w} (because of the edges {u, t} and {v, w}) we now have 
l^^l > \Ig\- 
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If lGr\Xr\Nc{t) 7^ then we must have lG<^Xr)NG{w) — 0; for if both sets are non-empty, 
then taking one vertex from IgO X O A^g(^) and one vertex from IgC) X O Ng{w) yields 
a pair which shows that {t, w} is X-blockable, which contradicts the preconditions to 



Rule 6 Using the same argument we must have that Icf^XdNGiu) = 0, otherwise {t,u} 
is X-blockable. Set I'q := {Ig \ {p,u,t,v,w}) U {u,w}. The neighborhood conditions 
show that no neighbors of m, it; in X are contained in Ig (and hence in Ig'), and because 
we explicitly delete any neighbors that u, w might have in F when forming Iq we see 
that I'g is also an independent set in G. If /g H X n Ncit) 7^ as specified by the 
precondition for this case, then we cannot have t ^ Ig because then Ig would not be 
independent. The edges {p, u} and {v, w} in G show that of the set {p, u, v, at most 
two vertices are in an independent set; hence in this situation Ig contains at most two 
vertices from {p, and therefore we have \I'g\ > \Ig\- 

- If /g n X n Ng{w) ^ then we must have that /g n X n NG{t) = /g n X n Ng{v) = 0, 
and we set I'q := {Ig \ {q-, u, t, v, w}) U {t, v}. The correctness argument is symmetric to 
that of the previous case. 

The argument above shows that we may assume without loss of generality that for one of the 

pairs {t, w}, {t, w}, {u, w} the independent set Ig contains both vertices of the pair. Using 

this assumption we show how to obtain an independent Ig' with \Ig' \ > \Ig\ ~ 2. 

H If i,w € /g then define Ig' '■= Ig \ {^j""^}. Since t,w & Ig implies that u,v ^ Ig we 
know that all vertices in Ig' still exist in G". It remains to show that they form an 
independent set there. Because the reduction to G' only adds edges incident on p and q, 
it suffices to show that for all edges incident on p or g which are added by the reduction 
there is at least one endpoint not in Ig' ■ The transformation from G to G' adds edges 
from NG{t) O X to p, and edges from NGiw) O X to q. But since t,w ^ Iq we know that 
the independent set Ig contains no vertices of NG{t) n X or Ng{w) n X, and hence the 
defined set Ig' is an independent set in G". 

H If i, u e /g then define Ig' ■— Ig \ All vertices in Iqi must exist in G' since 

u^w cannot be in Iq because their neighbors t,v wee in Ig- The edges we add in the 
transformation to G" do not violate independence: because t e /g we have NG{t)nlG = 0, 
and similarly because f G /g we have Ng{v) H Ig = ^ which in particular means q ^ Ig- 
For all edges that we add, at least one endpoint is not in Ig and therefore not in Ig'', 
this proves that Ig' is an independent set in G". 

H li u,w € Ig then define Ig' '-= Ig \ {u, w}- The proof of correctness is symmetric to that 
for the previous case. 

Since one of these cases must apply, the listing is exhaustive and it concludes the proof of 
this direction of the equivalence. 

(<;=) Let Ig> be an independent set for graph G' of size at least /c — 2. We show how 
to obtain an independent set Ig for graph G of size at least \Ig' \ + 2 > fc. The structure 
of Ig' determines how to augment to a larger independent set Ig by adding two vertices 
to Ig' - From the structure of the reverse transformation of G' to G it follows that Ig' is an 
independent set in G; hence for each case we will only show that the new vertices we add to 
the set will not violate independence in graph G. 

. If Ncit) n X n /g' = and Ng{w) n X n /g' = then assign Iq Ic U {t, w}- Since 
vertices i, w are clearly non-adjacent in G, and because the vertices in Ig' form an 
independent set in G (as the transformation to G does not add edges between vertices in 
Ig') we now have that Ig is an independent set in G of the required size. 

■ If Ncit) n X n Ig' ^ 9 then we must have Ng{w) n X n /g' = 0, otherwise taking one 
vertex from NG{t) DXn Ig' and one from Ng{w) 0X0 Ig' would give a pair which shows 
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that {t, w} is X-blockable in the original graph G, which contradicts the preconditions 
for Rule |6 Similarly we must have Ng{u) (1 X (1 Ig> = by the assumption that {u, t\ 
is not X-blockable in G. Since vertex p is adjacent in G' to all vertices of Nait) n X, 
we know that by independence of Iq' if Na{t) 0X0 Iq' then p ^ Iqi. We now 
set Ig '■= Ig' U {u, w} which must form an independent set in G because the established 
conditions show that none of the vertices of Ng{{u,w}) can be in Ig'- It is easy to see 
that > fc in this case. 

■ If Ng{w) n X n /g' then we must have A^g(^) OX OIg' = Ng{v) n X n /g' = by 
the non-blockability of {w,t} and {w,v}. We assign Ig ■— Ig' U {t,v}. The correctness 
proof is symmetric to that of the previous case. 
Since the case distinction is exhaustive this establishes the claim in this direction, which 
concludes the proof. A 



B.2 Structure of Reduced Instances 

The structure of reduced instances is captured by the following observations. 

► Observation 3. Let (G, X, k) be a reduced instance with the corresponding forest F G—X. 

H If u is a leaf of the forest F, then vertex v is adjacent in G to some vertex in X . For if v 
is not adjacent to a vertex in X then degG{v) = degp{v) = 1, but degree- 1 vertices are 
removed by |Rule|l This implies that a leaf v must be X-blockable in a reduced instance, 
or formally: the pair {v,v} must be X-blockable. 

B If the forest F contains two adjacent vertices u, v such that degp{u) , degp{v) < 2 then 



{u, v} is X-blockable by Rule 5 



If there are distinct vertices t, u, v,w in F such that deg^(u) = degp{v) — 3, Np{t) — {u}, 
Np{w) — {v} and {u,v} S E{F) then at least one of the pairs {u,t}, {v,w}, {t,w} is 
X-blockable by |Rule|6| 

For every tree T in the forest F there exists an independent subset of feedback ver- 



tices X' C X with \X'\ < 2 such that Conft(^') > by Rule 4 



► Observation 4. For every v € X we have CONFi?({u}) < \X\ by Rule 2 and every pair of 

NF 

for every reduced instance we have AcTiVEi?(X) < |Xp 



distinct non-adjacent vertices {u,v} C X satisfies CONFi?({u, w}) < \X\ by Rule 3 Hence 



\x\ 



B.3 Outline of the proof of Lemma |S] 



The next subsections lead up to the proof of [Lemma |8| which is re-stated and proven in this 
appendix as [Lemma (3T] As the proof is fairly substantial, we first sketch an outline of the 
proof. The size bound is obtained by showing that if T is a tree in the forest of a reduced 
instance, then Activet(-^) > g^|y(r)|. This claim is proven in two steps. Any tree must 
have at least a constant fraction of its vertices on degree-2 paths of length at least 8, or it 
must have at least a constant fraction of its vertices as leaves. We show that for both these 
quantities, the number of active conflicts induced by the set X on the tree T is at least a 
constant fraction of these quantities, i.e. at least a constant fraction of the total length of the 



degree-2 paths which have size > 8 (proven in Section B.4|, and at least a constant fraction 
of the total number of leaves (proven in Section B.5|. Section |B.6| then shows how these 



lemmas can be combined to give the complete proof of [Lemma 
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B.4 Conflicts from degree-2 paths in the forest 



In this section we show that the total number of active conflicts induced on a tree T in the 
forest F must be at least a constant fraction of the total length of the paths containing at 
least 8 vertices of degree 2 in the tree T. To establish this result we make extensive use of 



the independence decomposition of a forest as described in Proposition 1 We show how this 
decomposition behaves when removing certain types of edges and vertices, and we also show 
that under certain conditions the decomposition allows us to obtain a direct bound on the 
decrease in the independence number of a tree that results from deleting specifled vertices 
from that tree. 

► Lemma 16. Let H he a forest, and consider the independence decomposition A,N,S of 
the vertex set of H with the corresponding matching M on the vertices S. Let {u,v} € M 
and consider the subforest H' H — {u, w} obtained by deleting u, v and their incident edges. 
Then oi(H) — a{H') + 1 and the independence decomposition A' , N' , S' of H' can be obtained 
from A, N, S by restriction to the vertex set of H' . 

Proof. Assume the conditions in the statement of the lemma hold. We will first show the 
relationship between the independence numbers of H and H'. Because a MIS for H can 
use at most one vertex of u, v since they are adjacent, we can obtain an independent set 
in H' by removing at most 1 vertex from an independent set in H. Hence a{H') > a{H) — 1. 
From Observation 1 we know that a{H') < a{H). Now suppose that a{H') = a{H). 
Then the corresponding MIS in H' is also a MIS in H that contains neither u nor v, 



contradicting of [Proposition 1 since the edge {u,v} is contained in M . Therefore we must 
have a{H') < a{H) and combining this with the earlier derived fact that a{H') > a{H) — 1 
we find that a{H') = a{H) - 1. 

We now show that the independence decomposition A' ,N' , S' of forest H' is simply the 



decomposition A, N, S restricted to the vertex set of H' . By Proposition 1 we know that the 
independence decomposition of a forest is uniquely determined by adjacency relations. We 
will show that the suggested restriction A' ,N' , S' satisfies these adjacency relations and must 



therefore be the independence decomposition of H' . Property (I I must still hold because we 



have deleted a matched edge and its endpoints, which means that the matching restricted 
to the remaining vertices is still a perfect matching on H[S']. Since we do not give vertices 
in A extra neighbors, we do not violate (III And finally '( III l] cannot be violated because we 
only delete vertices in S, which means that all vertices in N' still have the same neighbors 
in A' as they have in A. This concludes the proof. 

Using the previous lemma as an induction step, the obtain the following. 

► Lemma 17. Let H be a forest, and consider the independence decomposition A, N, S of the 
vertex set of H with the corresponding matching M on the vertices S. Let y C M C E{H) 
consist of a subset of the edges matched under M , and consider the graph H' H — V(Y) 
obtained from H by deleting all vertices which are endpoint of an edge in Y . Then cx{H) = 
Ci(H') + \ Y\ and the independence decomposition A' , N' , S" of H' can be obtained from A, N, S 
by restriction to the vertex set of H' . 

Proof. Assume the conditions in the statement of the lemma hold. We will prove the 
statement by induction on For Y = % the statement is clearly true since then H' — H 
and |F| — 0. Now consider the case that \Y\ > 1, and assume the statement has been proven 
for all smaller values of \Y\. Consider some edge {u, v} g Y. Since {u, v} is an edge matched 
under M (by definition of Y) it follows from Lemma 16 that a{H) = a{H — {u, v}) + l. By the 
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second property of the Lemma we know that the independence decomposition oi H — {u, v} 
is just the decomposition of H restricted to the vertex set of H' and that all edges in Y' := 
y v}} are matched under the restriction Af. Hence we may apply induction to the graph 
H-{u,v} with the edge set Y' to establish that a{H-{u,v}) = a{H -{u,v}-V{Y')) + \Y'l 
which shows that a{H) = a{H - {u,v} - V{Y')) + \Y'\ + 1 = a{H - V{Y)) + This 
proves the claim. 

► Lemma 18. Let (G, X, k) be a reduced instance with forest F := G — X , and let T be a 

tree in F. If there is a subset of edges Z C E(T) such that the subgraph T' := T/Z satisfies 
a(T) = a(T') and the matching M' from the independence decomposition of T' contains q 
edges whose endpoints are X -blockable in G, then Active7'(X) > q. 

Proof. Assume the conditions in the statement of the lemma hold. Consider the matching Mb 
containing the edges in M' whose endpoints are A-blockable in G. By the definition of 
X-blockability we know that for every edge {m, u} in Mb there is a set A' C A which is 
independent in G such that \X'\ < 2 and {u,v} C Nc{X'). For such a set X' define the 
edges in Mb hit by X' as Hx'{Mb) {{u, v} \ {u, v} G Mb A {u, v} C Ng{X')}. We will 
show that for every such independent set X' we have Conft(A'') > \Hx'{Mb)\- 

Consider an independent set X' <Z X vn. the graph G such that |X'| < 2, and the 
subgraph T— V{Hx'{Mb)) obtained from T by removing all endpoints of edges in Hx'{Mb)- 
It follows that T - NaiX') C T - V{Hx'{Mb)) C T by the definition of Hx'{Mb). We can 
now bound the independence number of T — Nq{X') through the following derivation. 

a{T - Ng{X')) <a{{T - Ng{X'))IZ) By |ObservatTmr]T| 

< a((T/Z) - Ng{X')) By commutativity. 

= a{T' ~ Ng{X')) By definition of T'. 



< a(T' - V{Hx'{Mb))) Observation 1 V{Hx'{Mb)) C Ng{X') 



= a{T') -\Hx'{Mb)\ By [Lemma |17l 

= a{T) - \Hx'{Mb)\ Since a{T) a(T') by assumption. 



Observe that we are justified in applying Lemma 17 where T' plays the role of the forest H 



and the set Hx'{Mb) plays the role of the set edges Y , because all edges in Hx'{Mb) are 
contained in the matching M' from the independence decomposition of T'. By the definition 
of conflicts the derivation above shows that Conft(A') > \Hx'{Mb)\- By the definition of 
active conflicts, the independent set X' Q X contributes to the number of active conflicts 
since X' is either a singleton or a pair of non-adjacent vertices from X. This implies that 
every independent set X' C A in graph G with \X'\ < 2 contributes at least \Hx'{Mb) \ to 
the number of active conflicts on T. Since there are q edges in the matching M' that are 
A- blockable (by assumption in the statement of the lemma), each of these q edges occurs in 
the set Hx'{Mb) for at least one valid independent set X' and hence the total number of 
conflicts for all such sets X' must sum to at least q. By the definition of active confiicts this 
shows that ActivE7-(A) > g, which concludes the proof. < 

► Lemma 19. Let H be a forest, and consider the independence decomposition A, N, S of 
the vertex set of H with the corresponding matching M on the vertices S. If P is a simple 
path in H on vertices vi,V2, ■ ■ ■ ,vi such that deg^(ui) < 2 for all \ < i < I then either all 
vertices vi are in the set S , or the vertices on P are alternatingly contained in A and in N . 

Proof. Assume the conditions in the statement of the lemma hold. We will give a proof by 
contradiction using the adjacency relations that define the independence decomposition, as 
given in [Proposition |1[ 
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We first show that if P contains one vertex in S, then all vertices on P must be in S. So 
assume for a contradiction that P contains a vertex from S and a vertex not in S; then there 
are adjacent vertices x,y on P such that x G S and y ^ S. Since the sets A, N, S partition 

no vertex 



the vertex set of H, the vertex y must be in A or in N. But by ( II ) of Proposition 



in A can be adjacent to a vertex in S; since we assumed that vertices x, y are adjacent and 



X e S* this shows that y ^ A. Hence y must be contained in N. But by (III I of Proposition 



all vertices in N must be adjacent to at least two vertices in A. Because deg^(y) < 2 by the 
definition of the path P, and because the neighbor a; of y is in S, it follows that y cannot 
have two neighbors in A. Therefore y cannot be in N either, which is a contradiction to the 
fact that A, N, S partition V{H). This concludes this part of the proof; if P contains one 
vertex in S, then all vertices on P must be in S. 

For the second part of the proof, assume that all vertices in P are from sets A or N, but 
that these do not alternate; then there must be two successive vertices Vi,Vi+i on the path 
which are contained in the same subset of the partition. By the same argtiment as before, no 



adjacent vertices Vi, Vi+i can be in the set A by (II) But the vertices Vi, Vi+i cannot both be 
in the set N because vertices in N must have at least two neighbors in A. Since Vi^ Ui+i have 
degree at most 2 in by definition of P this cannot hold if the adjacent vertices Vi, w^+i are 
in N. Hence we again arrive at a contradiction, which proves the claim. 

► Lemma 20. Let H be a forest, and consider the independence decomposition A, N, S of the 
vertex set of H with the corresponding matching M on the vertices S. Let P be a simple path 
in H on vertices Vi,V2, ■ ■ ■ ,vi such that deg^(wi) < 2 for all 1 < i < I. Consider an edge 
{vi,Vi^i}, let H' := H/{{vi,Vi^i}} and let A',N' ,S' he the independence decomposition 
of H' . If one of the following conditions holds: 



1. 

2. 

then a{H) 



Vi £ A and Vi+i G N , 
Vi,Vi+i e S and {vi,Vi+i} ^ M, 



a{H') and all vertices Wi+i, ■ ■ ■ ,vi are in the set S' 



Proof. Assume the conditions in the statement of the lemma hold. We prove the two claims 
successively. 

([l| Since the endpoints of the edge {vi,Vi^i} are not in S, this edge is not contained in 
the matching M and by Lemma 13 we must have a{H) — a{H'). We will show that Vi+i G S' 
by proving that there is a MIS for H' which contains Wi+i, and that there is also a MIS 
for iJ' which avoids Vi+i- 



By assumption that w^+i S TV a MIS for H does not contain Vi+i by Proposition 1 and 
is also an independent set in H'; since a{H') — a{H) this shows there is a MIS for H' 
avoiding Vi+i. Since deg^('i;i+i) < 2 by assumption, we must have degfj,{vi+i) < 1 because 
we deleted the edge to its old neighbor Vi . By [Observation |2| this means there is a MIS 



for H' that contains w^+i. By the properties of the sets A' ,N' , S' stated in Proposition 1 the 
fact that Vi+i is contained in some but not all maximum independent sets of H' proves that 
Vi+i e 5". 

Now consider the path P' := {vi+i, . . . , vi} in H' . Since the first vertex on this path Vi+i 



is in S, [Lemma 19 shows that all vertices on this path must be in S' . This proves the claim. 

([2]) Now consider the second case. Since it is explictly assumed that {vi, Wi+i} ^ M we 
must have a(H) — oi{H') by Lemma 13 By Lemma 19 we derive that since Vi € S we 
must have Vj S S for all 1 < j < L To prove that the vertices Wi+i, are in S" we 



will show that the partitions A,N,S and A' , N' , S' are actually identical. By Proposition 1 
the independence decomposition is uniquely determined by adjacency relations. It is not 
hard to verify that since the sets A, N, S satisfy the adjacency relations on graph H, the 
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same partition must satisfy the adjacency relations on graph H' because we only remove 
a non-matching edge between vertices in S. Since the decomposition is unique, this shows 
that the partition A,N,S is also the independence decomposition of H' . Hence we have 
Wi+i € S' . By considering the subpath P' := {i^i+i, . . . ,vi} and invoking Lemma 19 we now 



find that all vertices on P' must be in the set S' , which proves the last part of the claim. < 

► Lemma 21. Let H be a forest, and consider the independence decomposition A,N,S of 
the vertex set of H with the corresponding matching M on the vertices S. Let V be a set of 
mutually vertex- disjoint simple paths in LI such that every P £ V contains at least 8 vertices, 
each of degree at most 2 in LI . Then there is a set of edges Z such that ol(LI) = a{H/Z) and 
such that the matching Mz from the independence decomposition of IL/Z contains at least 
I X^pgp sdges for which both endpoints lie on the same path in V. 

Proof. Assume the conditions in the statement of the lemma hold. We will prove the 
statement by induction on \V\. If V = 9 then the claim is trivially true. So assume that the 
claim holds for all sets of paths V' with \V'\ < {Vl- Consider a tree T in H containing at 
least one path in V] by the assumption that V ^ ^ such a tree T must exist. Choose an 
arbitrary leaf r e V{T) of the tree as the root. For all paths P ^ V which are contained 
in the tree T, we define their upper endpoint to be the endpoint of the path closest to r 
and the lower endpoint is the one with largest distance to r. Now take the path P e 7^ for 
which the upper endpoint has maximum distance from r, taken over all paths from V that 
lie in T and breaking ties arbitrarily. Label the vertices on the path P hy vi,V2, ■ ■ ■ ,vi by 
increasing distance from the root r. By [Lemma |19| we know that either all vertices on P are 
contained in S, or the vertices of P are alternatingly contained in A and N. Based on the 
local structure we select an edge e to go into the set Z. 

H If all vertices of P are contained in S, then consider the smallest i such that the edge 
{vi, Wi+i} is not contained in the matching M; because M is a perfect matching in T[S] 
we must have i = 1 or i = 2. Set e := {vi,Vi+i}. 

H If the vertices of P are alternatingly contained in A and N , then consider the smallest i 
such that Vi e A. Because the vertices alternate we have z = 1 or i = 2. Set e := {vi, Wi+i}. 

Consider the forest LI/{e} and its independence decomposition A',N',S' with matching M' . 



By Lemma 20 we know that a{H) = a{H/{e}) and that the vertices w^+i, must be 

in S". Consider the connected component Tp of IL/{e} that contains the vertex w^+i. By the 
choice of the path P the component Tp cannot contain any vertices for paths P* E V with 
P* =/= P; such a path would have an upper endpoint which is further from the root r than 
the upper endpoint of P. Now look at the subpath P' :— {vi+i, . . . ,vi} in II/{e}. Since all 
vertices of P' are in the set S' of the independence decomposition of H/{e}, and because M' 
is a perfect matching on the vertices of S", the edge {vi+i,Vi+2} must be in M' because 
Vi+i has degree 1 in H/{e} and cannot be matched to any other vertex. This shows that 
for all j in the range 0,1, . . . ,[{l — i)/2j — 1, the edge {vi+2j+i,Vi+2j+2} lies on P' and is 
contained in M', and hence there are at least [{I ~ i)/2\ such edges. Because i < 2 and I > 8 
this shows that the component Tp contains at least 1/3 edges which are contained in the 
matching M' , and for which both endpoints lie on the same path in V. 

To finish the proof we apply induction, and show that we can add the edge e that was 
defined above to the edge set resulting from the induction step. So apply the induction 
hypothesis on the forest H' := H — V{Tp) with the set of paths V' :=V\ {P} to find a set 
of edges Z' C E{LL') such that a{LI') ~ a{LL' /Z'), and such that the matching M'^ from the 
independence decomposition of LI' contains at least | Tlp^v 1^(^)1 edges for which both 
endpoints lie on the same path in V . Define Z := Z' \J {e}; we will prove that this set Z 
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satisfies the requirements. Let us first verify that a{H) = a{H/Z). 

a{H) ^ a{H/{e}) By [Lemma |2"0l 

= a{H') + a{Tp) Since H/{e} is the disjoint union of H' and Tp. 

— a{H' /Z') + a{Tp) By the induction hypothesis. 

= a{H/Z). Since H/Z is the disjoint union of H' /Z' and Tp. 

Now we prove that the matching AIz of the independence decomposition of H/Z must 
contain sufficiently many edges for which both endpoints belong to the same path in V. Since 
H/Z is just the disjoint union of H' /Z' and Tp, all edges for which this condition holds 
in H' /Z' still satisfy this condition for H/Z. By induction it follows that there are at least 
I J2pev' 1^(^)1 sueh edges. We saw earher that the component Tp contains at least 1/3 of 
such edges, where I is the length of the path P we used for the induction step. Hence there 
are at least i Epev 1^(^)1 + V3 - 5 Epev 1^(^)1 ed ges in H/Z which are contained in 
the matching Mz of the forest H/Z and for which both endpoints lie on the same path in V. 
This concludes the proof. 

To state the next lemma we need an additional definition. 

► Definition 22. Let T be a tree, and let the set 'P{T) contain all maximal simple paths 
in T containing only vertices which have deg ree at most 2 in T. Let V^^{T) be the subset 
obtained from P(T) by removing paths which contain less than 8 vertices. 

Note that the paths in V{T) are mutually vertex-disjoint by definition. 

► Lemma 23. Let (G, X, k) he a reduced instance with forest F :— G — X , and let T he a 

tree in F. Then Activet(X) > \ T,p&v>>^{t) 1^(^)1- 

Proof. Assume the conditions in the statement of the lemma hold. From [Observation 131 we 
know that all pairs {u,v} of adjacent vertices on paths in P{T) are X-blockable, because 
these vertices all have degree at most 2 in T and hence in F. By applying [Lemma |21| 
using the tree T as the forest H, and the set 'P-^{T) as the set of paths, we obtain a 
set of edges Z C E{T) such that a{T) — a{T/Z) and such that T/Z contains at least 
5 Spg-p>8(T) edges that are in the independence matching of T/Z and for which 

both endpoints are on the same path in V{T). Since all these edges must be X-blockable it 



follows from Lemma 18 that AcTiVET(Ar) > g Yl,Pev^^{T) 1^(^)1- 



B.5 Conflicts from leaves in the forest 

In this section we show that if T is a tree in the forest F associated with a reduced instance, 
then the number of active conflicts induced on T must be proportional to the number of 
leaves in the tree. To prove this claim we first introduce five types of substructures in a tree, 
and show that they contribute to the number of confiicts. We then prove that the number of 
conflicts we can get from such structures in a reduced tree is proportional to the number of 
leaves of the tree, by giving a greedy algorithm to find such structures and proving that this 
algorithm must be able to find many such structures. 

► Definition 24. We define five types of conflict structures for a tree T in the forest F of a 
reduced instance. For each such structure we also define its yield, which is a lower bound 
on the number of conflicts resulting from the presence of that structure. Given a subset 
X' Q X which is an independent set in G, we give conditions under which X' hits a conflict 
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structure, and the value for which the structure is hit by X' . If X' hits a conflict structure S 
for value q, then this means that if X' is used in an independent set this will cause the 
number of vertices from S which can be used in the independent set to decrease by at least q, 
compared to the situation where no vertex from X is used in the independent set. [Figure |3] 
shows examples of the structures. 



1. A conflict structure S of Type (1| consists of a parent vertex p e V{T) and all its 
adjacent leaves Nt{p) n Leaves(T), under the condition that \Nt{p) n LEAVES(r)| > 2. 
We define the yield of this structure to be \Nt{p) n LEAVES(r)| — 1. Observe that all 

A subset X' ^ X hits S for 



vertices Leaves(T) are X-blockable by Observation 3 



the value min(|iVT(p) n LEAVES(r) n Ng{X')\, \Nt{p) n Leaves(T)| - 1) if it is true 
that Nt{p) n Leaves(T) n Ng{X') ^ 0; otherwise X' does not hit S. 
2. A conflict structure S of [Type "( 21 consists of a leaf vertex I € Leaves(T) and its single 
neighbor {p} = Nt{1) such that the pair {l,p} is X-blockable. The yield of this structure 
is 1. A subset X' hits S for value 1 if {l,b} C Ng{X'). 



3. A conflict structure S of Type (3 ) consists of vertices t,u,v,w in V{T) such that deg-p(u) = 
degrp{v) = 3, Nrit) = {u}, Nt{w) = {v} and {u,v} £ E{T). By [Observation |3|at least 



one of the pairs {ti^} is AT-blockable in a reduced instance. The yield 

of this structure is 1. A subset X' hits the structure for value 1 if {u,t} C Ng{X') or 
{v,w} C Ng{X') or {t,w} C Ng{X'). 



4. A conflict structure S of Type (4 ) consists of vertices t, u, v, w, x in F (T) such that degj^(M) = 
deg'p(w) — 3, Nt{v) = {u, w}, Nrit) — {u} and Nt{x) — {w}. The yield of this struc- 
ture is 1. Observe that the condition degj^{t) = degj^{x) — 1 implies that both t and x are 



X-blockable by Observation 3 A subset X' hits the structure for value 1 if f G Ng{X') 
or X e Ng{X'). 



5. A conflict structure S of Type (5 1 consists of vertices l,p,u,vin V{T) such that degrp{p) — 
3, degr]^{u) — degq-{v) — 2 with Nt{1) ~ {p} and Nt{u) — {p,v}. The yield of this 
structure is 1. Observe that the condition degrp{u) = deg';-(u) — 2 implies that the 



pair {m, v} is AT-blockable by Observation 3 A subset X' hits the structure for value 1 if 
{u,v}CNg{X'). 

If S* is a conflict structure then Yield(S') represents the yield value of that structure. 



► Lemma 25. Let {G,X,k) be a reduced instance with forest F :— G — X, and let T be 
a tree in F. If S is a set of vertex-disjoint conflict structures in T then Active7'(X) > 
Ese5YiELD(5). 

Proof. Assume the conditions in the statement of the lemma hold. Consider some set X' <Z X 
with |A"'| < 2 which is independent in G. It follows from the definition of the confiict structures 



and Observation 3 that the sum over all such X' of the values for which they hit the structures 



in S is at least the sum of the yields of these structures. We prove the main claim of the 
lemma by showing that for each set X' satisfying the conditions stated above, the number 
of conflicts induced on T by the set X' is at least the total value for which it hits conflict 
structures in S. Since the number of active conflicts that X induces on T is deflned as the 
sum over all X' of the number of conflicts induced by X' , this will give a proof. 

So consider a set X' Q X with \X'\ < 2 such that X' is independent in G. Let q be 
the total value for which X' hits conflict structures in S. We prove that Conft(^') > q, 
which is equivalent to showing that a(T) — a{T — Ng{X')) > q and hence that a{T) > 
a{T - Ng{X')) + q. Consider an independent set in T - Ng{X') and cafl this set /'. We 
show how to construct an independent set I for tree T such that |/| > |/'| + q. We initialize 
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(c) Structure [Type )|3f| hit at vertices {t,w} by set (d) Structure Type (4i hit at vertex t by set X' 
X' = {a} for value 1. {a} for value 1. 




X 

(e) Structure [Type ([5f| hit at vertices {u,v} by set 
X' = {a, 6} for value 1. 

M Figure 3 Illustrations of conflict structures which are hit by subsets X' C X. The left image 
shows the original situation, whereas the right image shows the tree T' := T\ Ng{X'). 



the independent set / as a copy of /', and then show that for each conflict structure S hit 
by X' we can increase the size of I by the value for which X' hits S. Let s be the number 
of conflict structures in S which are hit by X' , and number these conflict structures as 
Si, . . . ,Ss- Let V (Si) for 1 < i < s denote the vertices of the conflict structure Si. We show 
how to construct a series of independent sets Iq, Ii, . . . , Is starting with Iq := I' such that 
h n NaiX') n V{Sj) = for alH + 1 < j < s, and such that - is at least the value 

for which X' hits structure Si+i. Since Iq = I is an independent set in T— Nc{X') it is easy 
to see that /q satisfies the intersection requirement. Since the size of the independent set 
under construction will increase at each step by the value for which the corresponding conflict 
structure is hit by X', this ensures we will terminate with a sufficiently large independent 
set Is- Recall that for distinct conflict structures Si, Sj we have V{Si) D V{Sj) = by the 
assumption that the conflict structures of S are vertex-disjoint. Given li for < i < s we 
show how to construct h+i, based on the type of structure S'i+i. 



1. If S'i+i is a structure of Type (1 1 then we determine the contents of the set li^i based on 
whether or not p e If p e then assign /i+i := {li \ {p}) U {Nt{p) n LEAVES(r)). It 
is easy to see that 7^+1 is independent in this case since we explicitly remove the only 
neighbor p of all the leaves that we add to the set. Since p G li implies that none of 
the vertices in Nt{p) D Leaves (T) are in li, the size of the independent set increases by 
\Nt{p) n Leaves(T)| — 1 which is at least as much as the value for which X' hits Si+i, by 
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definition of the value. If p ^ then assign /j+i /j U {Nt{p) LEAVES(r) n NaiX')). 
By the structure of the graph around p it is easy to see that is independent, since 
the only neighbor p of the added vertices is not in li by the case distinction. By the 
invariant of the construction procedure we know that li does not contain any vertices 
of Nt{p) n LEAVEs(r) n Ng{X') (because Nt{p) n Leaves(T) C V{S^+i)) and therefore 
l-^i+il > + \Nt{p) n LEAVES(r) n Ng{X')\ which shows that the size increases by at 
least as much as the value for which X' hits Si+i. 



2. If S'i+i is a structure of Type (2 1 then assign I^+i :— li U {I}. Since {l^p} ^ h by the 
invariant since {l,p} — ^(5^+1) C Ng{X'), and because I is a leaf in T, the new set I^+i 
is also an independent set in T and its size is 1 larger than the size of li ; hence the size 



increase matches the value for which X' hits S. 



i+i- 



3. If S'i+i is a structure of Type (3) then assign /i+i := {Ii\{u,v})U{t,w}. Because the only 
neighbors of t^w inT are u, v and these vertices are explicitly removed from /i+i it is easily 
seen that I^+i is independent in T. It remains to show that > + 1- To establish 

this we will show that li contains at most 1 vertex from {t, u, v, w}. By the definition of X' 



hitting the confiict structure Si+i of Type (3) we know that {u,t} C NciX') or {v,w} C 



NciX') or {t,w} C NciX'). But in each case the set {t,u,v,w} \ Nq{X') contains at 
most two vertices, and if it contains two vertices then these are connected by an edge. 
Hence the independent set li which does not contain any vertices of Ng{X') n V{Si+i) 
(by the invariant) can contain at most 1 vertex from {i, u, v, w}, and thus > + 1- 



4. If Si+i is a structure of Type (4 1 then assign J^+i :— {li \ {t, u, w, u;, x}) U {t, v, w}. It is 
easy to see that I^+i is an independent set in T; we will prove that > + 1 by 

showing that li contains at most 2 vertices from {t, u, v, w, x}. By the definition of X' 
hitting the conflict structure Si+i we know that t £ Ng{X') or x G Ng{X'). Assume 
without loss of generality (by symmetry) that t S Ng{X'). By the invariant this implies 
that t ^ li, and because G contains the edges {u, v} and {w, x} we know that from each 
of these edges at most one endpoint is in li. This shows that li contains at most 2 vertices 
from {t,u,v,w, x}, as required. 



5. If S'i+i is a structure of Type (5 1 then assign /i+i := {Ii\{l,p,u,v})U{l,u}. Once again it 
is easy to see that is independent, and we show that li contains at most 1 vertex from 
the set {l,p, u, v}. By the invariant we know that {u, v} ^ li since {u, v} C NGiX')nSi^i. 
By the existence of the edge {l,p} the set li contains at most 1 vertex from {l,p} which 
proves that > \Ii\ + 1. 

The fact that these induction steps preserve the invariant that li n Ng{X') n V{Sj) — for 
all i + 1 < j < s can be seen from the fact that the vertices which are added to by some 
update step are a subset of V{Si^i). Since the conflict structures in S wee vertex-disjoint 
this shows that vertices of some set V{Sj) n Ng{X') can only be part of sets Ij' for j' > j 
which implies that the invariant is maintained. 

The incremental process shows that |/s| > |/'| + q, where q is the total value for which X' 
hits the confiict structures in S and where Is is an independent set in T. As we have shown 
that we can transform any independent set for T — Ng{X') into an independent set for T 
containing at least q more vertices, it follows that a{T) > a{T — Ng{X')) + q. By the 
definition of confiicts this implies that Conft{X') > q. Because the sum over all subsets X' 
satisfying the requirements must be at least as large as the total yield of the confiict structures 
S, this proves by the definition of active confiicts that ActwetIX) > X^ses Yield(S'). < 

► Lemma 26. Let (G, X, k) be a reduced instance with forest F := G — X , and let T be a 

tree in F such that \V{T) \ > 1. There is a set S of vertex-disjoint conflict structures in T 
such that I]se5 Yield(S') > || Leaves(T)|. 
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Proof. We use a proof by construction which finds a set of confiict structures. The procedure 
grows a subtree T' C T and set S incrementally, and during each augmentation step of the 
tree we enforce an incremental inequality which shows that the number of leaves in T which 
are contained in T', is proportional to the total yield of the conflict structures found so far. 
This proof strategy is inspired by the method of "amortized analysis by keeping track of 
dead leaves" which is used in extremal graph theory |23j . 

We use the following properties in the analysis. We say that the vertices Leaves (T') n 
Leaves(T) are the closed branches of the tree T'. The vertices Leaves(T') \ Leaves(T) are 
the open branches of T' . Observe that when we have grown the tree T' until it equals T, 
then for T' = T the number of open branches is and the number of closed branches of T' is 
equal to the number of leaves in T. For the subtree T' C T let Ot' denote the number of 
open branches of T', and let Ct' denote the number of closed branches. We also construct a 
set S of conflict structures, and enforce the invariant that at each stage of the construction 
for all conflict structures S £ S we have V{S) C V{T') \ (LEAVES(r') \ LEAVES(r)), i.e. the 
vertices we use in conflict structures are contained in T' and are not open branches of T'. 
Define the total current yield Yg as J^seS^^^^^^'^)- ^'^^ ensure that our augmentation 
operations respect the following incremental inequality: 

2AOt' + 5AYs > ACt'- (1) 

The A values in the incremental inequality refer to the difference between the new status 
of T' and S compared to their old status; if T' has 5 open branches at a given moment, 
and we perform an augmentation after which it has 4 open branches then AOt' = — 1 for 
that step. We define the augmentations to the tree T' by adding vertices to it; it will be 
understood implicitly that the subtree T' we are considering is the subtree of T induced by 
the vertices we have added to T' . 

We will show that the subtree T' and the set S can be initialized and grown such that 
each augmentation satisfies this incremental inequality, until T' = T. At that stage we will 
have Ot' = 0, and by summing the incremental inequality over all augmentation steps we 
then find Yg > Ct'/5 = \ LEAVES(r)|/5 which will prove the claim. Hence to establish the 
claim all that remains is to give the initialization and augmentation operations. [Figure |4| 
illustrates some augmentations. 

We ensure that for all vertices which are internal to the tree T' we have Nt{v) — Nt'{v); 
i.e. if we make a vertex internal in T' then we add all its neighbors to T', and not just a 
selected few. 

Initialization. There are three possible ways to initialize the tree T', depending on the 
structure of T. 

► Operation 1. If there is a vertex p G V{T) such that \Nt{p) n LEAVES(r)| > 2 then 



initialize T' as Nt [p] and add the Type ( 1 1 conflict structure S consisting of the vertices 



{p} U {Nt{p) n Leaves(T)) to the set S. The number of closed branches of T' is equal to 
Nt{p) n LEAVES(r) > 2, the number of open branches is equal to \Nt{p) \ Leaves(T)| and 
Yield(5) ^ I5 = \Nt{p) n LEAVES(r)| - 1 > \\Nt{p) n Leaves(T)|. It is straightforward 
to verify that this combination of values satisfles the incremental inequality. We leave 
this veriflcation as an exercise to the reader, and do not explicitly re-state that all further 
augmentation operations satisfy the incremental inequality. 

► Operation 2. If there is no vertex p as described by [Operation |1| then every vertex in T 
is adjacent to at most 1 leaf. If T' contains a leaf I with Nt{1) — {p} such that the pair 
{l,p} is X-blockable, then initialize T' := A'rb] and add the Type (2| conflict structure S 
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M Figure 4 Illustrations of some augmentation operations. The subtree T' is visualized by drawing 
the edges in E{T) f] E{T') using thick lines, whereas the edges E{T) \ E{T') are thin lines, (a) 



Tree T in the forest F. (b). Initialized subtree T' by Operation 1 around p, which involves adding a 
conflict structure of Type ( 1 1 on vertices {p} U (Nt(p) H LEAVES(r)) = {p, a, 6, c}. Vertices {d, e} are 
now open branches of T' and {a, b, c} are closed branches. ACt' = 3, AOy/ = 2 and Al^ = 2. (c) 
The vertex v is an open branch of T' , with its neighbor u outside T' . (d) Added {u} to T' through 



Operation 



ACt 



1, AO 



which involves adding a conflict structure of Type (2 1 on vertices {u,v} for yield 1 
-1 and AFs = 



v) with its neighbor u, and adding {u} to T' through Operation 



(e) Result of considering the next open branch (re-labeled 

ACt' = AOt' = AFs = 0. (f) 



Result of applying the first case of [Operation |ll| to the vertex drawn as v in this picture, which 
became an open branch after the previous operation. This operation involved adding a conflict 
structur e of [Type (|3|| on vertices {v,u,t,w} for yield I. ACt' = 2, AOt' = 0, AXs = 1- (g) After 
applying [Operation |4 for a final time, again adding a conflict structure of [Type (|2[|| for yield 1, the 
process is complete since T' = T. AOt' = 1, AOt' = — 1 and AYs = 1. 



on vertices to the set S, which results in a yield of 1. Observe that all A values must 

be non-negative, since we are initializing the tree. Because p is adjacent to at most 1 leaf {p 
itself can be a leaf if T = K2) we have ACt' < 2 which satisfies the incremental inequality. 

► Operation 3. If both of the previous operations are not applicable then every vertex in T 
is adjacent to at most 1 leaf, and no leaf of T is X-blockable with its parent; this implies by 
[Observation |3| that all leaves of T are adjacent to vertices of degree > 3. Consider a leaf I 
with Nt{1) — {p} and initialize T' := Nt[p]- It follows that deg/p(p) > 3, and since I is the 
only neighbor of p which is a leaf we know that we have created at least 2 open branches 
and exactly 1 closed branch. 

Since we assume |y(r)| > 1 one of the three given operations must be applicable to any 
tree T. 

Augmentation. We now show that if the subtree T' and set S are initialized and T' 
then we can augment T' by adding more vertices to it, and possibly adding more conflict 
structures to S, while respecting the incremental inequality. liT' ^ T then there is some 
open branch of T', i.e. there is a vertex v € Leaves(T') \ Leaves(T). We show how to 
augment the tree T' near this vertex v. We say that a vertex u S Nt{v) \ V{T') is a neighbor 
of u outside T' , and a vertex u G Nt{v) H V{T') is a neighbor inside T' . Since we initialize 
tree T' to contain at least two vertices, we know that v has exactly one neighbor inside T' . 
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It can be verified from the description of the initiaUzation procedure that for this vertex v 
its single neighbor inside T' cannot be a leaf of T or T'; this fact will be used later on. 

When describing the augmentation steps of the subtree T' we will use to refer to the 
status of the tree before the augmentation, and T^' to refer to its status after the augmentation. 
When the intended meaning is clear from the context we will just write T' . Now consider an 
open branch vertex v of tree T' as described above. 

► Operation 4. If degrp{v) = 2, then consider the unique neighbor u of v outside T' . 
Ifu ^ Leaves(T) then add {u} to T'; the numbers of open and closed branches do not change. 
If u e Leaves(T) then add {u} to T' and add the conflict structure S on vertices {v,u} of 
Type (2| to tree T' . By Observation 3 the pair {v,u\ must be X-blockable and therefore S 



is a valid conflict structure of yield 1. We have AOt' = — 1, ACt' = 1 and Al^ = 1. 

So in the remaining cases the open branch of T' at vertex v has degy(w) > 3. 
► Operation 5. If w is adjacent to at least 2 leaves of T then add Nt{v) to T' and add 



a conflict structure of Type (ll on vertices {v} U {Nt{v) n Leaves(T)) to S. Since the 
neighbor of v which is inside cannot be a leaf of T or T' by the definition of the 
initialization, all leaf- neighbors of v are not in but are added to the tree to obtain T^. 
Let us consider how the number of open branches is affected by this operation. Vertex v 
is no longer an open branch vertex in T^, but its non-leaf neighbors which are not in 
have become open branch vertices. So we find that AOt' — \Nt{v) \ Leaves(T)| — 2, 
ACt' = \Nt{v) n Leaves(T)| and AYs = \Nt{v) n Leaves(T)| - 1, which satisfies the 
incremental inequality since \Nt{v) Leaves (T)| > 2. 

In the remaining cases we know that v is adjacent to at most 1 leaf of T. 

► Operation 6. If degj,(u) > 4 then add Nriv) to T' . We find that ACt' < 1, AOt' = 
\Nt{v) \ Leaves(T)| — 2 and AYg = which satisfies the incremental inequality since 
|A^t(w)\Leaves(T)| > 3. 

In the remaining situations we have deg^piv) = 3 and v is adjacent to at most 1 leaf of T . 

► Operation 7. If v is not adjacent to any leaves of T, then add Nt{v) to T' . We have 
ACt' = 0, AOt' > and AYg = 0. 

If none of the cases above hold then we know that degrp{v) = 3 and v is adjacent to 
exactly 1 leaf I G Leaves (T). From the definition of the initialization procedure we know 
that I is a neighbor of v outside T'. Hence vertex v has exactly one non-leaf neighbor w 
outside T' . Consider the path v,w, . . . in T obtained by following degree-2 vertices until the 
first vertex u of degree ^ 2 is encountered; this implies that all the vertices which are in 
the interior of the unique path Puv between u, ?; in T have degree-2 in T, the endpoints u, v 
of Puv have degree 7^ 2 in T, and v is the only vertex on this path which is in T' . Observe 
that we could have u = it; if u is a neighbor of v outside T' of degree at least 3. 

► Operation 8. If degj^{u) = 1 then the path Puv must contain at least one vertex of degree 2 
in its interior, by the assumption that v is adjacent to only 1 leaf and we started tracing the 
path in the direction of the non-leaf neighbor w of w. Let {p} = Nt{u) be the predecessor 
of u on Puv Add Nt{v) and all vertices on P to the tree T', and add the conflict structure 



of type Type (2) on vertices {p,u} to S for a yield of 1. If we consider this augmentation 
operation in its entirety then the leaf-neighbor I of v and the vertex u become closed branches 
of r'. The number of open branches decreases by 1 since vertex v becomes an internal node 
in Tl. Hence we find that ACt' 2, AOt' = -1 and AYs = L 
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In the remainder we may assume that degrp^u) > 3. 

► Operation 9. If u is adjacent to at least 2 leaves of T, then these must be neighbors 
outside T' by the structure of the initialization. Add Nt{v) and all vertices on the path Puv 
and the vertices Nt{u) to the tree T', and create a conflict structure of Type (|1 1 on the 



vertices {u} U {Nt{u) H Leaves(T)) for a total yield of \Nt{u) H Leaves(T)| - 1. For 
this operation we find that AOt' > -1, ACt' = \Nt{u) n LEAVES(r)| + 1 and AI5 = 
|A^t(u) nLEAVES(T)| - 1. 

In the remainder we know that u is adjacent to at most 1 leaf of T. 

► Operation 10. If u is not adjacent to any leaves or degrp{u) > 4 then add all vertices 
Nt{v), all vertices on Puv and the vertices Nt{u) to T' . We find that ACt' < 2, AOt' = 
\Nt{u) \ Leaves(T)| - 2 > 1 and AYg = 0. 

► Operation 11. In the final situation we know that u is adjacent to exactly 1 leaf and 
deg'p(u) = 3. We now augment and create a conflict structure based on the number of 
degree-2 vertices on the path Puv between u and v in T. 

B If there are no degree-2 vertices between u and v (i.e. vertices u and v are adjacent in T) 
then we add all vertices Nt{v) and Nt{u) to T'. Consider the unique leaf w adjacent 
to V and the leaf t adjacent to u. By [Observation 3 we know that at least one of the 



pairs {v, w}^ {u, t}, {t, w} must be X-blockable, and hence we can add a conflict structure 
of Type (3l]to S for a yield of 1. For this augmentation we find ACt' — 2 since the 



leaves t, w are added to T', we have AOt' = because one neighbor of u will be a leaf 
in Tl but not in T (which compensates for the fact that v ceases to be an open branch), 
and AYs ^ 1. 

If there is exactly 1 degree-2 vertex between u and v then call this vertex w. Add 
all vertices Nt{v), all vertices on Puv and the set Nt{u) to T' . Let t be the unique 



leaf adjacent to u and let x be the unique leaf adjacent to v. By Observation 3 both 
vertices t and x are X-blockable, which means we may add a conflict structure of Type ( 4 1 
consisting of vertices t, u, w, v,x to S for a total yield of 1. We have ACt' = 2, AOt' = 
and AYs — I which suffices. 

In the remaining situations there must be at least two degree-2 vertices between u and v on 
the path Puv Let x,y be the first two degree-2 vertices succeeding v on this path, which 
means that Nt{x) = {v, y}. We add the vertices Nt{v) U {x, y} U Nxiy) to T' . Since x, y 



have degree-2 in T we know from Observation 3 that the pair {x, y} is AT-blockable 



Therefore we may add a conflict structure of Type ( 5 1 let ^ be the unique leaf of T 
adjacent to u, and add the conflict structure on vertices to S for a yield of 1. 

We find that ACt' = 1, AOt' = and AYs = 1- 
This concludes the description of the augmentation operations. Although we have not 
explicitly argued that each operation maintains the invariant that for all confict structures S 
that we add to S we have V{S) C V{T') \ (Leaves(T') \ LEAVES(r)), this is straight-forward 
to verify. From this invariant it also follows that the resulting set of conflict structures is 
vertex-disjoint: the conflict structures that we add consist of vertices which were either 
open branches of T' before the augmentation or were not yet contained in T' prior to the 
augmentation. Because the case distinction is exhaustive we have shown that whenever T' is 
not yet equal to T we can augment the tree T' while respecting the incremental inequality. By 
the arugment given above this proves that the resulting set of conflict structures S satisfies 

E5e5YlELD(5)> i|LEAVES(r)|. ^ 
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► Lemma 27. Let (G, X, k) he a reduced instance with forest F :— G — X , and let T be a 

tree in F. Then Activet(^) > LEAVES(r)|. 



Proof. Consider such a tree T. If |T^(T)| = 1 then it follows from Observation 3 that there 
is an independent subset X' X with \X'\ < 2 such that Conft(-'^') > 0, from which it 
follows that Activet(X) > 1 > ^| Leaves(T)| = i. 



If > 2 then apply Lemma 26 to tree T to obtain a set S of vertex-disjoint conflict 

i|LEAVEs(r)|. We apply 



Lemma 



structures in T with a total yield of at least 
set S which proves that Activet(X) > Esgs^ield(S') > i| LEAVES(r)|. 
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with this 
■4 



B.6 Putting It Together 

In this section we use the results from the previous sections to obtain a proof of [Lemma |8| 

► Lemma 28. IfT is a tree then for all constants < c < 1 either X]pg-p>8(T) 1^(^)1 — 
l\V{T)\ or|LEAVES(T)|>(l-i)3^|nr)|. 

Proof. Let T be a tree. We will show that J2pev>»{T) 1^(^)1 < implies that 

I LEAVES(r)| > (1 - l)Te\V{T)\- Consider the subtree T' obtained from T by repeatedly 
deleting a degree-2 vertex v with N{v) — {u, v} and replacing it by a direct edge {u, v}. From 
this definition it follows that Leaves(T) = Leaves(T') and V>3{T) = V>3{T'). Since T' 
does not contain any degree-2 vertices we know that V{T') = Leaves(T') U V>3{T'). We 
partition the edge set of T" into and for a counting argument, based on how many 
degree-2 vertices were deleted to form the edge. 

An edge {u, v} € E{T') belongs to the set E>g, if the unique path between u and w in T 
contains at least 8 vertices of degree 2, and it belongs to E^s otherwise. Let Pt{u,v) for 
{u,v} C V{T) denote the unique simple path in T between u and v, and let V{Pt{u,v)) be 
the vertices on that path, including the endpoints. Now define: 



V{E>s) {w e V{T) I degrpiw) = 2 A v} G E>8 s.t. w e V{Pt{u, v))} 
V{E<s) ■■= {w e V{T) I degj^H = 2 A 3{u, v} e E^g s.t. w e V{Pt{u, v))}. 



Since every degree-2 vertex in T must lie on the interior of some path Pt{u, v) for vertices u, v 
of degree unequal to 2, it follows that the sets V{E>s) and V{E<^s) partition the set of 
degree-2 vertices in T. Hence the vertex set of T is partitioned into Leaves(T), V>3(T), 
V{E>g,) and V{E^s). Since all the vertices in V{E>g,) lie on a path consisting of at least 8 
degree-2 vertices, we know that all paths containing these vertices are in the set V-^{T) and 
therefore |F(i?>8)| < J2pev^>i{T) 1^(^)1- Since every edge in E^g is the result of deleting at 
most 7 degree-2 vertices we have |y(£'<8)| < 7\E{T')\. We now have sufficient information 
to complete the proof. 
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\V{T)\ 

= I Leaves(T)| + \V>3{T)\ + \V{E>s)\ + \V{E<s)\ 
= I Leaves(T')| + \VMT')\ + \ViE>s)\ + \V{E^s)\ 

< I LEAVES(T')| + I LEAVES(r')| + |V^(-B>8)| + |V"(£;<8)| 

\ViEy,)\+7\E{T')\ 

|l/(S>8)|+7(2|LEAVES(r')|) 

^\ViE>s)\ 

f E 1^(^)1 



By the partition of E{T'). 
Correspondence T' and T. 



< 2|LEAVES(r')| 4 

< 2|LEAVES(r')| 4 

< 16|LEAVES(r')| 

< 16|LEAVES(r')| 



By Proposition 2 on T'. 
By observations above. 



< 16|LEAVES(r')| 

< 16|LEAVES(r)| - 



-\V{T)\ 



-\ViT)\ 



By Proposition |2| 
Simplifying. 

By observations above. 

By initial assumption. 
Leaves(T) = LEAVES(r'). 



The claim follows from this inequality by simple formula manipulation. 



.Pev>-(T) \ViP)\ > ii\ViT)\ then by applying 



Lemma 



23 



Lemma 
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G — X, and let T be a 

we find that AcTiVET(-'i^) > 
we find that Activet(X) > 



► Lemma 29. Let {G,X,k) be a reduced instance with forest F 
tree in F. Then ActivEt(X) > ^\V{T)\. 

Proof. Consider such a tree T in the forest F. 

- ifE 

\V{P)\ > i-,\V{T)\. 
. If |LEAVES(r)| > (1-1,) ji,|T/(r)| then by applying 

11 Leaves(T)| > i(l - i^)^,\V{T)\ = i-,\V{T)\. 
Since |Lemma |28| shows that one of the two cases above must hold, this concludes the 
proof. 

► Lemma 30. Let X' C X , and let C{F) denote the connected components of F . The 
following must hold: 

1. GO^Ff{X') - T.TeC(F) Conft(X'). 

2. ACTIVEf(X') = Y.TeC(F) Activet(X'). 

Proof. The first claim follows from the fact that a MIS in a forest is simply the union of 
maximum independent sets for the trees in the forest. The second claim follows directly from 
the first by the definition of active conflicts. ■< 



► Lemma 31 (Restatement of Le mma |8| ). Let (G, -X^, /c) he a reduced instance of FVS- 
Independent Set with forest F .^G-X. Then \V{G)\ < \X\ + 83\X\^. 

Proof. Assume the conditions in the statement of the lemma hold. The vertex set of G 
consists of X U V{F). The important step is to bound |V^(-F)| in the size of \X\. Let C{F) 
denote the connected components of F. We now have: 



Active(A:) = E AcTiVE(r) 

TeC(F) 
TeC{F) 

>-\V{F)\. 



83' 



By [Lemma |30| 
By [Lemma |29| 
Since F is the union of its components. 
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Thus \V{F)\ < 83 Activef(X) which implies that \V{F)\ < 83{\X\'^ + < 83\X\ 



by |Observati(m]4| Since \V{G)\ = \X\ + \V{F)\ we have now shown that \V{G)\ < \X\ 
83|X|3. 



B.7 The kernelization algorithm 

We now give the proof of [Theorem |9| which was omitted from the main text. 

Proof of [Theorem 191 We sketch the algorithm and then prove that it is correct. When 
given an input (G, X, k) the algorithm exhaustively applies the reduction rules, until these no 
longer change the graph. Each reduction rule can be applied and tested for applicability in 



polynomial time. For Rule 1 this follows from Theorem 1 and for the other reduction rules 



this is easy to verify since it involves finding and manipulating bounded-size bounded-degree 
substructures in the graph, and computing the independence numbers of forests. Each 
application of a reduction rule either deletes vertices and their incident edges, or adds 
edges to the graph. Since we can delete at most |1^(G)| vertices, and since we can add 
only 0(|T^(G)p) edges, the procedure must stop after a polynomial number of applications 
of the reduction rules. The resulting instance is output as {G' , X' , k') if k' > 0; otherwise 
we output a constant-size yes instance. The reduction rules do not increase the values \X\, 
k or \V{G)\ - k which proves the bounds on \X'\, k' and \V{G')\ - k'. The fact that 



\V{G')\ < 2{\V{G')\-k') follows from Theorem 1 whereas the bound \V{G')\ < \X\ + S3\X 



|3 



follows directly from [Lemma [51 The correctness proofs of the reduction rules show that 
{G,X,k) and {G',X',k') are indeed equivalent instances. 

Using the previous proof, [Corollary [10| follows easily from the dual nature of Independent 
Set and Vertex Cover. 

Proof of [Corollary [10[ Given an instance {G,X,k) of Fvs- Vertex Cover we transform 
it into an instance {G,X, \ V{G) \ — k) of fvs-Independent Set, which is an equivalent 
instance because the complement of an independent set is a vertex cover. We apply the 



kernelization algorithm from Theorem 9 to (G, X, |y(G)| — k) to compute in polynomial time 
an equivalent instance (G',X', |V"(G')| — k'). By adjusting the target value we transform 
this back to an instance {G',X',k') of FVS- Vertex Cover and use it as the output which 
shows that \V{G')\ < 2k' and \V{G')\ < \X'\ + 83\X'\^. < 



Omitted Proofs: No Polynomial Kernel for FVS-Weighted 
Independent Set 

► Lemma 32. Paired Vector Agreement is NP-complete. 

Proof. It is trivial to see that Paired Vector Agreement is in NP. To prove it is NP- 
hard, we give a polynomial-time many-one reduction from Vertex Cover. Let (G, k) 
be an instance of Vertex Cover on n vertices; we construct an equivalent instance 
{L' ,t' ,m' ,k') of Paired Vector Agreement. For simplicity we identify the vertices of G 
with numbers [1 . . . n]. We may assume that the target size k for the vertex cover is less 
than n, otherwise it is a trivial yes instance. 

We now describe the vector pairs in the Paired Vector Agreement instance. Each 
vector pair corresponds to a vertex, so t' :— n. The first vector of a pair will represent the 
choice of taking the vertex in the vertex cover, and the second vector represents the choice 
to avoid it. We set k' := k. There are vectors (a*, 6*) for 1 < i < n. The pairs of vectors in 
the list L have length m' := + n each. 
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H Vector a' has a # at position ri^ + i, and ? elsewhere. 

H Vector 6* has a at positions [{i — l)n +!...(«— l)n + n], has a 1 at ah positions [{j — 
l)n + 1 . . . (j — l)n + n] for which vertex j is a neighbor in G of vertex i, and has ? 
elsewhere. 

This concludes the description of the Paired Vector Agreement instance {L',t',m', k'). 
It is trivial to see that the instance can be constructed from {G, k) in polynomial time. We 
now prove that the two instances are equivalent: we can select one vector from each pair for 
a total of at most k conflict positions <;4> graph G has a vertex cover of size at most k. We 
prove the two directions separately. 

(=>) Suppose that we can select one vector from each pair such that the chosen vectors 
induce at most k conflict positions. Look at two vertices i,j which are adjacent in G, 
and consider the effect of choosing the 5-vectors V- and V for both these vertices. By the 
definition of the &- vectors we know that all positions [{i — l)n +!...(« — l)n + n] are a 1 for 
vector 6\ but since j is adjacent to i these positions are set to in vector IP \ hence if we 
choose and V simultaneously then all these positions are conflict positions. Similarly all 
positions \{j — l)n + 1 . . . (j — l)n + n] must then be conflict positions. This shows that if 
we choose the 6- vectors for two adjacent vertices, then this induces at least 2n conflicts. By 
our assumption that k < n this shows that whenever the chosen vectors induce at most k 
conflict positions, then the a-vectors which are chosen must form a vertex cover of G: if 
we take the 6- vectors for adjacent vertices then this induces > k conflict positions. Every 
a- vector, say a*, that is chosen induces a unique conflict position through the =ff symbol at 
position + i, and therefore a choice of vectors with at most k conflict positions contains at 
most k vectors of type a. Since these vectors form a vertex cover, this proves that G has a 
vertex cover of size at most k. 

(-4=) Now suppose that graph G has a vertex cover of size k. By the structure of the 
Paired Vector Agreement instance sketched previously, it follows that if we take the 
a-vectors of all vertices in the cover, and the 6-vectors for all other vertices, then we have 
a vector set which induces at most k conflicts; one conflict for each chosen a-vector. This 
concludes the proof. 

► Lemma 33. ^-Paired Vector Agreement is OR-compositionat there is an algorithm 
that receives s instances of t-P aired Vector Agreement with equal value t of the parameter, 
and outputs an instance {L* ,t* ,m* ,k*) in time polynomial in the total size of the input 
instances, such that [L* ,t* ,m* , k*) is a YES-instance if and only if one of the input instances 
is a YES-instance, and such that t* < 2t. 

Proof. Suppose we are given s instances of Paired Vector Agreement that all have 
the same parameter value t. It is easy to see that we can solve an instance of ^-Paired 
Vector Agreement in 0*(2*) time by exhaustively trying all ways of choosing one vector 
from each pair. This means that if s > 2* then the composition algorithm can successively 
solve each instance exactly, for a total running time of 0*(s2*) which is polynomial in the 
total input size if s > 2* . We can then output a constant-sized yes instance if any of the 
input instances is a YES-instance, and otherwise output a constant-sized NO instance. So in 
the remainder we may assume that s < 2*. We now pad the instances to make it easier to 
compose them together. If the number of input instances s is not a power of 2, then we may 
duplicate some instances in the sequence of inputs to increase s to the next power of 2. Next 
we ensure that all instances look for the same number of conflicts k. We flnd the instance 
with the largest value of k, and if there is some other instance x' with k' < k then we append 
k — k' elements to each vector of instance x', and give each vector a value of ^ in these 
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positions; this ensures that the original version of x' has a set with fc' conflict positions if 
and only if the padded version of x' has a set with fc conflict positions. As the final step we 
unify the vector lengths. If some instances use differing vector lengths then we pad vectors 
by appending ?-entries to the end, until all vectors have the same length. It is easy to verify 
that these steps do not increase the size of the input by more than a polynomial factor. 

So now we may assume that we are given 2*" instances (Lj, i, m, fc) indexed by < j < 2*" 
for some integer r < t, where all instances use the same values for t,m and k. We show 
how to compose all these instances into one new instance {L*,t*,m*, k*). We have r + t 
pairs of vectors in our output instance, so t* ~ r + t < 2t. The first t pairs are formed by 
concatenating vectors from the input instances. Pair i for 1 < i < t is defined as (a*,&*) 
where a* is obtained by concatenating all vectors Ui from the sets Lj for < j < 2'', and 
similarly b* is the concatenation of all vectors bi from Lj for < j < 2"". Hence all vectors 
have length m* :— 2^m. This construction implies that if we restrict the resulting vectors to 
positions [jm + 1 . . . jm + m] then we re-obtain the vectors corresponding to instance j. It 
remains to describe the last r pairs of vectors; these will enforce the selection of exactly 1 
active instance in any solution set. So we have exactly r instance selection pairs defined as 
follows: 

H The a- vector of the i-th instance selection pair has a ? at positions [qm + 1 . . . qm + m] 
for which the i-th bit in the binary notation of the value g is a 0, and has a ^ elsewhere. 

B The 6- vector of the i-th instance selection pair has a ? at positions [qm + 1 . . . qm + m] 
for which the i-th bit in the binary notation of the value g is a 1, and has a # elsewhere. 
This gives a complete description of the instance selection pairs. The total vector set for the 
output instance consists of the union of the r instance selection pairs, and the t pairs that 
hold concatenated values for the input instances. We define the target number of conflicts 
as k* :~ (2^ — l)m + k, which concludes the description of the output instance. It is not hard 
to verify that the output instance can be computed in time which is polynomial in the total 
size of the input instances. The new parameter value t* is clearly bounded polynomially in 
the old parameter value t since t* < 2t. All that remains is to prove that the answer to the 
output instance is yes if and only if the answer to one of the input instances is yes. We will 
separately prove both directions of this equivalency. 

{<=) First assume that input instance j (with < j < 2^) is a yes instance; we show 
that the output instance is a yes instance. Consider the choice of vectors for instance j 
that induce at most k conflicts. We show how to choose one vector from each pair in the 
output instance such that the resulting set induces at most (2'' — l)m + k conflicts. Consider 
the binary notation of the number j, which can be specified by r bits br,br-i, . . . , 6i. We 
choose vectors from the instance selection pairs according to the values of these bits. For 
1 < j < r we choose the a- vector of the z-th instance selection pair if bi — 0, and we choose 
the 6- vector if &i = 1. Now consider the effect of these choices on the conflict positions in the 
chosen vectors. For all values j' ^ j with < j' < 2'', there is some bit in which the binary 
notation of j' differs from that of j; since the vector we chose at that position corresponds to 
the value of j instead of the value of j', this implies that all positions [j'm + 1 . . .j'm + m] 
must be conflict positions since we have chosen a vector which has a # there. Since there 
are 2^ — 1 such values j' ^ j these choices already induce {2^ — l)m conflicts, and only 
the positions [jm + 1 . . . jm + m] can be conflict-free - the instance selection vectors that 
we chose all have ? values there. We have yet to specify a choice of vectors in the pairs 
containing concatenated input vectors. The choices for these pairs match the choice made in 
the solution for the yes instance j: if the solution to instance j chooses the a- vector of pair 
j, then we also choose the a- vector of concatenated pair i; otherwise we choose the 6- vector 
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there. Since the concatenated vectors, when restricted to positions [jm + 1 . . . jm + m] are 
equal to the input vectors of instance j, these vectors induce at most k conflicts in positions 
[jm + 1 . . . jm + m] (by the assumption that we started from a valid solution for yes instance 
j). Since the instance-selection vectors only have ?-elements in the range [jm + 1 . . . jm + m] 
these do not induce additional conflicts, and thus the total number of conflicts induced by 
this choice of vectors is at most {2^ — l)m + k. 

(^) Now assume that the output instance is a YES instance. Similar as before, the choice 
of (a, b) vectors in the instance selection pairs correspond to the binary representation of a 
number < j < 2^ and for all j' ^ j with < j' < 2^ we must have conflicts at positions 
[j'm + 1 . . .j'm + m] by the definition of the instance selection vectors. Hence we have 
(2^ — l)m conflicts in these positions, and if the answer to the output instance is yes then it 
has at most k conflicts in the positions [jm + 1 . . . jm + m] . But since the concatenated set of 
vectors restricted to the positions [jm + 1 . . . jm + m] forms exactly the vector set of instance 
j, it follows that instance j is a yes instance. This concludes the proof of correctness of the 
composition algorithm. 

► Lemma 34. The instances (L,t,m,k) and (C ,w' , X' ,k') in the polynomial parameter 
transformation of \Lemma\ll\ are equivalent: we can choose one vector out of each pair in L 
for a total of < k induced conflict positions graph G' has an independent set with total 
weight > k' . 

Proof. We prove the two directions separately. 

{^) Suppose S' is a set containing exactly one vector from each pair in L such that 
the number of conflict positions induced by S is at most k. Now construct an independent 
set / in G' as follows. Each vector corresponds 1-1 with a vertex in the feedback vertex 
set X' of the graph G' . Choose vertex in / if contains the a- vector for pair i, and 
choose v\ in / if S* contains the 6- vector of pair i. By the adjacency structure of G' the set 
we obtain in this way is independent, and it has weight exactly t(2(t + m)) — 2t(t + m) 
since we have chosen one vertex from each pair for a total of t pairs, and all such vertices 
have weight 2(t + m) by deflnition. Now consider how the selected vertices in I are adjacent 
to position- vertices p° and pj. If the vectors of S have a conflict at position j, then our 
current set / contains neighbors of both and (either because we have selected a vertex 
corresponding to a vector with a # at position j, or because we have selected one vector 
with a there and one with a 1). But if the vectors of S have no conflict at position j then 
for at least one vertex of {p^ j p} } we have not chosen any of its neighbors in / and we may 
add such a vertex to /. This choice will not cause position vertices for positions j' ^ j to 
obtain neighbors in /, because of the adjacency structure of the graph. This implies that 
for every position where the vectors S induce no conflict, we gain an additional vertex of 
weight-1 in the independent set. Since there are at least m — k such positions, we add at 
least m — k extra weight to our independent set which proves that we flnd an independent 
set of weight at least 2t{t + m) + [m — k) = k' which establishes the claim in this direction. 

{<=) Suppose / is an independent set in G" of weight at least k' = 2t{t + m) + (m — k). We 
first show that / must contain exactly one vertex from each pair {wf, for 1 < i <t. Observe 
that no independent set can contain both vertices of a pair, as the vertices are adjacent in G' . 
Now suppose that we have an independent set for which there is a pair i such that / contains 
no vertex of the pair. Then this independent set has weight at most 2(t — \){t + m) + m 
because it contains at most 1 vector vertex with weight 2{t + m) of each pair, and the 
remaining m vertices all have weight 1. But since we assumed that k < m we then flnd 
that 2{t — l)(t + m) + m < 2t{t + m) + {m — k), a contradiction to the assumption that 
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the weight of / is at least k' . So we know that / contains exactly one vertex from each 
pair {w", u^}. Since the remaining vertices have weight 1, the set / must contain exactly m — k 
of such degree-1 position vertices. But as we showed earlier, an independent set in G' can 
only contain one vertex from a position pair, and only if the vector-vertices that are selected 
do not induce a conflict there. So for the selected vector-vertices in / there are at least m — k 
positions where they do not induce a conflict, which implies that those vectors induce at 
most k conflicts. This proves the claim. A 



